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Preface 


Graphs are usually represented as geometric objects drawn in the plane, con- 
sisting of nodes and curves connecting them. The main message of this book is that 
such a representation is not merely a help of visualizing the graph, but an impor- 
tant mathematical tool. It is obvious that this geometry is crucial in engineering, if 
you want to understand rigidity of frameworks and mobility of mechanisms. But 
even if there is no geometry directly connected to the graph-theoretic problem, a 
well-chosen geometric embedding has mathematical meaning and applications in 
proofs and algorithms. This thought has emerged in the 1970’s, and I found it 
quite fruitful: Among its first applications, it lead to a classification result in the 
theory of node-coverings, to the theory of orthogonal representations, and through 
this, to several combinatorial applications of semidefinite optimization. 

I have given quite a few courses and lectures about the fast expanding theory 
of geometric representations of graphs. In 2014, I gave an advanced course on 
this topic at the Eidgenossische Technische Hochschule in Zurich, and the interest 
and active participation of the audience, faculty, postdocs and graduate students 
provided great inspiration. I am most grateful for the hospitality of ETH. 

By that time, I had decided that it is worthwhile to merge my handouts and 
lecture notes about this general topic into a book. This took quite some time; 
partly, because other duties limited the time I could spend on this project, but 
also because I kept recognizing common concepts that extended through several 
seemingly unrelated topics, and holding, I hope, the material better together (for 
example, randomization, non-degeneracy and duality). Many new results and new 
applications of the topic have also been emerging, even outside mathematics, like in 
statistical and quantum physics and computer science (learning theory). At some 
point I had to decide to round things up and publish this book. 
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CHAPTER 1 


Why are Geometric Representations Interesting? 


To represent a graph geometrically is a natural goal in itself, since it provides 
visual access to the abstract structure of the graph. But, in addition, it is an 
important tool in the study of various graph properties, including their algorithmic 
aspects. To illustrate what this means, let us describe a few examples of increasing 
complexity. 


1.1. Edge coloring 


We start with a very simple application of a well-chosen geometric image. How 
many colors are needed to color the edges of a complete graph so that no two edges 
incident with the same node get the same color? The answer depends on the parity 
of the number of nodes n. If n is odd, then at most (n—1)/2 edges can use the 
same color, so we need at least 

n n—-1 
gy 
2 2 


colors. To show that this is sufficient, we consider the nodes as the vertices of a 
regular polygon. An edge and all diagonals parallel to it can be colored with the 
same color. Rotating this set about the center of the polygon, we get n such sets of 
edges, which together cover all edges of the graph. So they form an edge-coloring 
with n colors (Figure [1.1] left). 


FIGURE 1.1. Optimal edge coloring of a complete graph with an 
odd and even number of nodes. 


If n is odd, then the analogous computation gives a weaker lower bound of 


(5)/g=0- 


for the number of colors. Is this sufficient? To improve the construction by a similar 
use of rotations as for odd n, we step up to dimension 3. We represent the nodes 
as the vertices of a pyramid with an (n—1)-sided regular polygon as its base. For 
any edge e of the base, we can color the diagonals parallel to e as well as the edge 
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connecting its apex to the base vertex opposite to e with the same color. Rotating 
this set again about the axis of the pyramid, we get a coloring of the edges of K,, 
with n—1 colors (F igure [1.1] right). 

There are of course many other ways to solve this simple problem; but the 
above construction using geometry is perhaps the nicest. 


1.2. Disjoint paths 


Suppose that we have a 2-connected graph G and two specified nodes s and t. 
Two “requests” come in for two nodes x and y, and we have to find two disjoint 
paths connecting s and ¢ to x and y (it does not matter which of x and y will be 
connected to s). This can be computed by one of zillions of flow or connectivity 
algorithms in reasonable time. 

Now suppose that we have to compute such paths for many requests {x,y}. Do 
we have to repeat the computation each time? We can do much better if we use the 
following theorem: Given a 2-connected graph and two specified nodes s and t, we 
can order all nodes so that s is first, t ts last, and every other node v has a neighbor 
that comes earlier as well as a neighbor that comes later. Such an ordering is called 
an s-t numbering. It is not hard to construct such a numbering (Exercise [1.2). 

Once we know an s-t numbering, and a request {x,y} comes in, it is trivial to 
find two disjoint paths: Let (say) x precede y in the ordering, then we can move 
from « to an earlier neighbor x’, then to an even earlier neighbor x” of 2’ etc. until 
we reach s. Similarly, we can move from y to a later neighbor y’, then to an even 
later neighbor y” of y’ etc. until we reach t. This way we trace out two paths as 
requested. 

The ordering can be thought of as representing the nodes of G by points on the 
line, and the easy procedure to find the two paths uses this geometric representation. 


1.3. Hitting times 


In this book, we will discuss some properties of random walks on a graph (as 
far as they are related to geometric representations). We’ll also use the physical 
representation of a graph with the edges replaced by rubber bands. As a sampler, 
let us describe a very simple but nice connection. 

Let a be a node of a connected graph G. We start a walk at v° = a. We select 
one of the neighbors v! of v° at random (every neighbor has the same probability 
of being selected), and we move to v'. Then we move from v! to v? in the same 
way, etc. This way we get an infinite sequence of random nodes (v°,v',v?,...), 
which we call a random walk on G. 

Many important questions can be asked about a random walk (we are going 
to talk about some of them later). Perhaps the following is the simplest. Let b be 
another node of G. We define the hitting time H(a,b) as the expected number of 
steps before a random walk, started at a, will reach 6. (This number is known to 
be finite for a finite connected graph.) 

There are many interesting questions you can ask about hitting times. To begin 
with, what are the hitting times on particular graphs like paths, cycles, or trees? 

The following construction is very useful answering quite a few of these basic 
questions (of course, there are questions about hitting times whose answer requires 
a much more sophisticated approach). Consider the edges of the graph as rubber 
bands; these are ideal (or really high tech) rubber bands, contracting to zero length 
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when not stretched, and not getting tangled. Attach a weight of deg(v) to each 
node v. Nail the node 6 to the wall and let the graph find its equilibrium (Figure 


1.2). 


3} 1 


FIGURE 1.2. Hanging a tree, a path, and a 3-cube from a node. 
The graphs are horizontally distorted to show their structure. The 
stretching of the edges is easily computed using Hooke’s Law. 


The simple but useful fact about this construction is that each node v will be 
at a distance of H(v,b) below b. This is not hard to prove noticing that the hitting 
times to b and the distances from b satisfy the same equations (see Exercise 1.4). 

Using this geometric/physical picture, we can derive several interesting facts 
about hitting times. Applying the construction to a path with k nodes, we see 
that the k-th edge from the bottom is stretched to a length of 2k—1 (this is the 
total weight of the nodes below it). Hence the total length of the path will be 
143+---+(2n—3) = (n—1)?; this is the hitting time from one endpoint of the 
path to the other. Or, using the symmetries of the skeleton of the 3-dimensional 
cube, we can see that the lowest edges are stretched to length 1 (3 edges carry a 
weight of 3), the next layer, to length 2, and the edges incident to b, to length 7. 
So the hitting time from a node to the opposite node is 10. Exercises [I.5}{1.7] offer 
some further applications of this geometric construction. 


1.4. Shannon capacity 


The last two examples used just a single dimension; of course, one-dimensional 
geometry is not “really” geometry, and we better give a higher-dimensional example, 
which is substantially more involved. The following problem in information theory 
was raised by Claude Shannon, and it motivated the introduction of orthogonal 
representations |Lovasz 1979b] and several of the results to be discussed in this 
book. 

Consider a noisy channel through which we are sending messages composed 
of a finite alphabet V. There is an output alphabet U, and each v € V, when 
transmitted through the channel, can come out as any element in a set U, C U. 
Usually there is a probability distribution specified on each set U,, telling us the 
probability with which v produces a given u € U,, but for the problem we want to 
discuss, these probabilities do not matter. As a matter of fact, the output alphabet 
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will play no role either, except to tell us which pairs of input characters can be 
confused: those pairs (v, v’) for which U,NU, 4 9. 

One way to model the problem is as follows: We consider V as the set of nodes 
of a graph, and connect two of them by an edge if they can be confused. This 
way we obtain a graph G, which we call the confusion graph of the alphabet. The 
maximum number of non-confusable messages of length 1 is the maximum number 
of nonadjacent nodes (the maximum size of a stable set) in the graph G, which we 
denote by a(G). 

Now we consider longer messages, say of length k. We want to select as many 
of them as possible so that no two of them can possibly be confused. This means 
that for any two of these selected messages, there should be a position, where the 
two characters are not confusable. As we shall see, the number of words we can 
select grows as O* for some © > 1, which is called the Shannon zero-error capacity 
of the channel. 

A simple and natural way to create such a set of words is to pick a non- 
confusable subset of the alphabet, and use only those words composed from this 
set. So if we have a@ non-confusable characters in our alphabet, then we can create 
a® non-confusable messages of length k. But, as we shall see, making use of other 
characters in the alphabet we can create more! How much more, is the issue in this 
discussion. 

Let us look at two simple examples (Figure[1.3). 


Ww 


b d w ¢ 


FIGURE 1.3. Two confusion graphs. In the alphabet {p, q, b, d} 
two letters that are related by a reflection in a horizontal or ver- 
tical line are confusable, but not if they are related by two such 
reflection. The confusability graph of the alphabet {m,n, u,v, w} 
is only convincing a little in handwriting, but this graph plays an 
important role in this book. 


Example 1.1. Consider the simple alphabet (p,q,d,b), where the pairs {p,q}, 
{q,d}, {d,b} and {b,p} are confusable (Figure[1.3] left). We can just keep p and d 
(which are not confusable), which allows us to create 2" non-confusable messages 
of length k. On the other hand, if we use a word, then all the 2" words obtained 
from it by replacing some occurrences of p and q by the other, as well as some 
occurrences of b and d by the other, are excluded. Hence the number of messages 
we can use is at most 4*/2* = 2%, r) 


Example 1.2 (5-cycle). If we switch to alphabets with 5 characters, then we 
get a much more difficult problem. Let V = {m,n,u,v,w} be our alphabet, with 
confusable pairs {m, n}, {n, u}, {u, v}, {v, w} and {w, m} (Figure[1.3} right; we refer 
to this example as the “pentagon” ). Among any three characters there are two that 
can be confused, so we have only two non-confusable characters. Restricting the 
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alphabet to two such characters (say, m and v), we get 2" non-confusable messages 
of length k. 

But we can do better: the following 5 messages of length two are non-confusable: 
mm, nu, ww, vn and wv. This takes some checking: for example, mm and nu 
cannot be confused, because their second characters, m and u, cannot be confused. 
If k is even, then we can construct 5*/? non-confusable messages, by concatenating 
any k/2 of the above 5. This number grows like (V/5)* = 2.236" instead of 2*, a 
substantial gain! 4 


Can we do better by looking at longer messages (say, messages of length 10 or 
1000), and by some ad hoc method finding among them more that 5° non-confusable 
messages? We are going to show that we cannot, which means that the set of words 
composed of the above 5 messages of length 2 is optimal. 

The trick is to represent the alphabet in a different way. Let us assign to each 
character i € V a vector u; in some Euclidean space R@. If two characters are not 
confusable, then we represent them by orthogonal vectors. 

If a subset of characters S is non-confusable, then the vectors u; (¢ € S) are 
mutually orthogonal unit vectors, and hence for every unit vector c, 


So (ctui)? <1. 
ics 
Hence |.$| minjeg(c'u,)? < 1, or 
IS|< — : 
max ———~ < max ———... 
~ ies (clu;)? — ieV (clu,)? 
So if we find a representation u and a unit vector c for which the squared products 
(c'u;)? are all large (which means that the angels <(c, u;) are all small), then we 
get a good upper bound on |S]. 


ow 


a 
Ficure 1.4. An umbrella representing the pentagon. 


For the alphabet in Example (the pentagon), we use the 3-dimensional 
vectors in Figure [1.4] To describe these, consider an umbrella with 5 ribs of unit 
length. Open it up to the point when nonconsecutive ribs are orthogonal. This way 
we get 5 unit vectors U,,, Un, U,, Uy, U,,, assigned to the nodes of the pentagon so 
that each u; forms the same angle with the “handle” c and any two nonadjacent 
nodes are labeled with orthogonal vectors. With some effort, one can compute that 
(c'u;)? = 1/./5 for every i, and so we get that |S| < V5 for every non-confusable 
set S. Since |S| is an integer, this implies that |S] < 2. 

This is ridiculously much work to conclude that the 5-cycle does not contain 3 
nonadjacent nodes! But the vector representation is very useful for handling longer 
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messages. We define the tensor product of two vectors u = (u1,...,Un) € R” and 
Vv = (U1,---,Um) € R™ as the vector 

UoV = (U1U1,---, U1Um, U2V1,---, U2Um; +++; UnV1,--- Shieti €R™. 
It is easy to see that Juov| = |u||v|, and (more generally) if u,x € R” and 


v,y € R™, then (uov)'(xoy) = (u'x)(v'y). For a k > 1, if we represent a 


message 71 ...7,% by the vector u;,°0---ou,;,, then non-confusable messages will be 
represented by orthogonal vectors. Indeed, if 21...%, and j1...j, are not confus- 
able, then there is at least one subscript r for which i, and j, are not confusable, 
hence ul Uy, = 0, which implies that 

(uj, 0-- -ouj;,) (Uj, O-+-OU;,) = (uj,uj, ) bes (uj u;, ) =0. 
Using co---oc (k factors) as the “handle”, we get that for any set S of non- 
confusable messages of length k, 


1 1 
|S| << max = max = (V5)*. 
41s sb ((co -+-00)"(uj, O°: -oui,)) i1,-stk (cTuj, )?... (cK, )? 


So every set of non-confusable messages of length k has at most (V5)* elements. 
We have seen that this bound can be attained, at least for even k. Thus we have 
established that the Shannon zero-error capacity of the pentagon is \/5. 

We will return to this topic in Sections and where the zero-error 
capacity problem will be discussed for general confusability graphs, in classical and 
quantum information theory. 


1.5. Vector-labeled graphs or frameworks? 


One of the most basic objects we study in this book is a graph whose nodes are 
labeled by vectors from a euclidean space R¢. We can also think of these vectors 
as the positions of the nodes in R¢. A mapping u: V —> R¢@ can be thought of 
as a “drawing”, or “embedding”, or “geometric representation” of the set V in a 
Euclidean space. Most of the time, V will be the node set of a graph, and we 
should think of u; as the position of node 7; in this case, we think of the edges 
as straight line segments connecting the points in the corresponding position. The 
main point in this book is to relate geometric and graph-theoretic properties, so 
this way of visualizing is often very useful. On the other hand, all three phrases 
above are ambiguous, and we are going to use “vector labeling” in most of the 
formal statements. This is the computer science view: we have a graph and store 
additional information for each node (see Figure[1.5). A vector-labeled graph (G, u) 
will also be called a framework, motivated by the important topic of rigidity. 

The difference between “vector-labeled graphs” and “geometric representa- 
tions” is a bit more than just different usage of words. In computer science (indeed, 
in any area that uses and analyzes data), one considers large tables where each row 
corresponds to an item of some sort, and the numbers in the row represent differ- 
ent data (for example, age, weight, height, income of a person, or frequencies of a 
word in various types of documents). Often the set of rows has a network structure 
(say, people who know each other or words with similar meaning can be considered 
“adjacent” in these networks). 

Surprisingly, the geometry of the row vectors often contains important informa- 
tion about the data, and geometric manipulation can lead to a better handle on the 
data, even though these data had nothing to do with geometry. While obviously 
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(ss) (27) 


1 1 


FIGURE 1.5. Two ways of looking at a graph with vector labeling 
(not all vector labels are shown). 


closely related to our more mathematical material, we cannot treat in detail this 
emerging line of research. 

While this is not strictly true, most of the time we can distinguish two kinds of 
vectors: those which are “true” vectors (best to think of them as geometric objects, 
like point positions or velocities), and those which are assignments of numbers, like 
weighting the nodes or edges of a graph. “True” vectors will be denoted by boldface 
characters. Mathematically, there is no difference of course. 


Exercise 1.1. Prove that s-t numberings can be characterized as orderings of 
the nodes in which the nodes preceding any node induce a connected subgraph, 
and the same holds for the nodes following a given node. 


Exercise 1.2. Design an algorithm that constructs an s-t numbering of a graph 
in polynomial time. 

Exercise 1.3. Let us assign a real weight f(e) to every edge of a transitive 
tournament on n nodes. 

(a) For v € V, let P, be the set of all pairs (a,b) such that there is a directed 
path of length a leaving v along which the weights are monotone increasing, as 
well as a directed path of length b leaving v along which the weights are monotone 
non-increasing, starting with the same edge. Prove that the sets P, are different. 
(b) Prove that ifn < () , then either there is a directed path of length p+1 along 
which the weights are monotone increasing, or there is a directed path of length q+ 


1 along which the weights are monotone non-increasing. |Chvatal-Komlds 1971): 


Exercise 1.4. (a) Prove that the hitting times to a given node of a connected 
graph G satisfy the equations 

H(u,b) = 1+ Sw S> A(v,u), (u#d). 

vEN(w) 

(b) Prove that the distances to b in the construction in Section satisfy the 
same equations. (c) Prove that this implies that the hitting times to b are equal 
to the distances from b. 
Exercise 1.5. Compute the hitting time (a) between two nodes of a cycle of 
length n; (b) between two nodes of the skeleton of the dodecahedron; (c) from 
one vertex of the d-dimensional cube to the opposite one. 


Exercise 1.6. Let G be a tree, and let i7 € E. Let k be the number of nodes 
of G\ij in the component containing 7. Then the hitting time from i to 7 is the 
2k—-1. 

Exercise 1.7. Let G be a simple connected graph, and let 17 € FE. Then | H(i, 7) — 
H(j,i)| <n? —3n+3, and this can be attained. 
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Exercise 1.8. Prove that for every graph with at most 5 nodes, with the exception 
of the pentagon, its Shannon capacity equals to the maximum cardinality of a 
stable set (mutually nonadjacent nodes) of the graph. 


CHAPTER 2 


Planar Graphs 


In the first half of this book, we consider mostly planar graphs and their geomet- 
ric representations, mostly in the plane. We start with a survey of basic results on 
planar graphs. This chapter is of an introductory nature; we give very few proofs, 
and this material might go as well to the Appendix as well-known background. 
Still, planar graphs play a very important role, and some of the considerations here 
provide a good introduction into the methods in this book, so this chapter deserves 
to be part of the main material. We assume familiarity with basic graph theory; 
notation and terminology are summarized in Appendix [B| 


2.1. Graphs and maps 


2.1.1. Planarity and duality. A multigraph G is planar, if it can be drawn 
in the plane so that its edges are Jordan curves and they are disjoint except for 
their endnodes. A planar map is a planar multigraph with a fixed embedding. We 
also use this phrase to denote the image of this embedding, i.e., the subset of the 
plane which is the union of the set of points representing the nodes and the Jordan 
curves representing the edges. It is trivial that every subgraph of a planar graph is 
planar. It is slightly less trivial that contracting an edge to a single node preserves 
planarity. 

The complement of a map G decomposes into a finite number of arcwise con- 
nected pieces, which we call the countries of the planar map. (Often the countries 
are called “faces”, but we reserve the word “face” for the faces of polyhedra.) The 
set of its countries will be denoted V*, and often we use the notation f = |V*|. 
There is one special country, which is unbounded; the other countries are bounded. 
The unbounded country most often does not play any special role; we can make 
any other country p unbounded by applying an inversion with respect to a circle 
contained in p. 

A planar map is called a triangulation if every country is bounded by a triangle. 
Note that a triangulation may have parallel edges, but two parallel edges cannot 
bound a country. In every simple planar map G we can introduce new edges to 
turn all countries into triangles while keeping the graph simple. 

If the planar map is connected (which we are going to assume most of the time), 
then every country is homeomorphic to an open disk, except for the unbounded 
country, which is homeomorphic to an open ring. Each country has a boundary 
consisting of a cyclic sequence of edges. An edge can occur twice in the boundary 
sequence; this happens if and only if it is a cut-edge (isthmus) of the multigraph. 
If the multigraph is 2-edge-connected, then no boundary sequence contains a repe- 
tition, and if it is 2-node-connected, then every boundary is a (simple) cycle. The 
country also defines a cyclic sequence of nodes; a node my occur many times in this 
sequence. Each occurrence of a node in this sequence is called a corner. In other 
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words, a corner is defined by two edges of the boundary of the country, called the 
edges of the corner, which are consecutive in the cyclic order along the boundary, 
but also in the cyclic order of the edges around the node. These two edges are 
different except if the corner is a node of degree 1. 


FIGuRE 2.1. A planar map and its dual. 


Every connected planar map G has a dual map G* = (V*, E*) (Figure (2.1). 
As an abstract graph, this can be defined as the multigraph whose nodes are the 
countries of G, and if two countries share k edges, then we connect them in G* by 
k edges. So each edge e € E will correspond to a unique edge e* of G*, and vice 
versa; hence |E£*| = |E| = m. If the same country is incident with e from both 
sides, then e* is a loop. For a country p, we denote by deg(p) its degree in the dual 
map. This is the number of edges on its boundary, with the addition that if the 
boundary passes an edge twice, then this edge is counted twice in deg(v). 

This dual map has a natural drawing in the plane: in the interior of each 
country F' of G we select a point ve (which can be called its capital), and on each 
edge e € E we select a point ue (this will not be a node of G*, just an auxiliary 
point). We connect vr to the points u. for each edge on the boundary of F' by 
nonintersecting Jordan curves inside F’. If the boundary of F' goes through e twice 
(i.e., both sides of e belong to F’), then we connect up to ue by two curves, entering e 
from two sides. The two curves entering u,. form a single Jordan curve representing 
the edge e*. It is not hard to see that each country of G* will contain a unique 
node of G, and so (G*)* = G. Contracting an edge that is not a loop in a planar 
map corresponds to deleting it from the dual, and vice versa. 

Often we will need an orientation of G. Then each (oriented) edge e € E has a 
tailtl(e) € V, a head hd(e) € V, a right shore rs(e) € V*, and a left shore Is(e) € V*. 
There is a natural way to define an orientation of the dual map G* = (V*, E*), so 
that the dual edge e* € E* crosses the corresponding edge e € F from left to right. 
The dual of the dual of an oriented map is not the original map, but the original 
map with its orientation reversed. 

From a planar map, we can create further planar maps that are often useful. 
The lozenge map G° = (V°, E®) of a planar map G is defined by V° = VUV*, 
where we connect p € V to i € V* if p is a node of country i. So G° has an edge 
for every corner of G. We do not connect two nodes in V nor in V*, so G® is a 
bipartite graph. It is easy to see that G° is planar. 

The dual map of the lozenge map is also interesting. It is called the medial 
map of G, denoted by G"‘, and it can be described as follows: we subdivide each 
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edge by a new node; these will be the nodes of the medial map. For every corner 
of every country, we connect the two nodes on the two edges bordering this corner 
by a Jordan curve inside the country (Figure [2.2). The medial graph has two 
kinds of countries: those corresponding to nodes of G and those corresponding to 
countries of G. Thus the countries of G'S can be two-colored so that every edge 
separates countries of different colors. This of course is equivalent to saying that 
G° is bipartite. 


FIGURE 2.2. The dual map, the lozenge map and the medial map 
of a part of a planar map. 


We can avoid the complications of the exceptional unbounded country if we 
consider maps on the sphere. Every planar map can be projected onto the sphere 
by inverse stereographic projection (Figure [2.3] see also Appendix[C.4.1), and vice 
versa. Often this leads to simpler statements, since one does not need to distinguish 
an unbounded country. (On the other hand, it is easier to follow arguments in the 
plane.) 


FIGURE 2.3. Projecting a planar map onto the sphere by inverse 
stereographic projection. 


2.1.2. Euler’s Formula. We often need the following basic identity for con- 
nected planar maps, called Euler’s Formula (where n denotes the number of nodes, 
m denotes the number of edges, and f denotes the number of countries): 


(2.1) n—m-+f =2. 
This follows easily from the observation that if we take a spanning tree of a planar 
map, then those edges of the dual graph which correspond to edges outside this tree 


form a spanning tree of the dual graph (Figure |2.4). The spanning tree has n—1 
edges, the spanning tree of the dual has f—1 edges, whence (n—1)+(f—1) =m. 
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FIGURE 2.4. Spanning trees in two planar graphs, and the corre- 
sponding spanning trees in their duals. For the grid on the right, 
the broken edges emanating through the boundary go to the capital 
of the unbounded country, which is not shown. 


Let us mention some important consequences of Euler’s Formula. For a simple 
planar map, each country has at least 3 edges, and adding these up, we count every 
edges twice. So we have 2m > 3f = 3(m+2~—n), and rearranging, we get 


(2.2) m < 3n—6. 


If, in addition, the graph has no triangles, then a similar simple counting argument 
gives that 


(2.3) m <2n—A4. 
From these inequalities it is easy to derive the following fact. 


Proposition 2.1. Every simple planar graph has a node with degree at most 5, 
and every triangle-free simple planar graph has a node with degree at most 3. 


From (2.2) and (2.3) it follows immediately that the “Kuratowski graphs” Ks, 
and K3.3 ( be reas 2.5] 5) are not planar. This observation leads to the following 
characterization of planar graphs. 


Theorem 2.2 (Kuratowski’s Theorem). A graph G is embeddable in the plane 
if and only if it does not contain a subgraph homeomorphic to the complete graph 
Ks or the complete bipartite graph K3.3. 


K 
K, K K, 3,3 


FIGURE 2.5. The two Kuratowski graphs. The two drawings on 
the right show that both graphs can be drawn in the plane with a 
single crossing. 
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To show a further typical application of Euler’s Formula, we prove a lemma, 
which will be useful in several arguments later on. In a planar map whose edges 
are 2-colored, we call a node quiet if the edges of each color emanating from it are 
consecutive in the cyclic order of the embedding. 


Lemma 2.3. In every simple planar map whose edges are 2-colored there are at 
least two quiet nodes. 


Proof. We may assume that the graph is 3-connected: we can add edges to 
make it 3-connected, and color them arbitrarily; a quiet node in the extended graph 
is quiet in the original graph as well. We call a corner bicolored, if its two edges are 
differently colored. We are going to estimate the number N of bicolored corners 
and the number A of quiet nodes in two different ways. 

If a node v is not quiet, then the color of edges incident with it changes at least 
four times in their cyclic order according to the embedding. (In particular, every 
such node must have degree at least four.) This means that v is incident with at 
least 4 bicolored corners, nd hence 


(2.4) N >A(n—h). 


On the other hand, let f, be the number of countries with r edges on their boundary. 
The maximum number of bicolored corners of a country with r edges is 


r—1, ifr is odd, 
r, if r is even. 


Thus the number of bicolored corners is at most 


(2.5) N < 2fg+4fa+4fs5+6fe+6f7+... 
Using the trivial identities 
f=fatfatfst+... and 2m = 3f3+4fat+5fst+... 


we get by Euler’s formula 
N < 2f3t+4fat4fs+6fet6frt+---<2fs+4fat+6f5+8fe+... 
=4m—4f =4n-8. 
By (2.4), it follows that h > 2. 


2.1.3. 3-connected planar graphs. Among planar graphs, 3-connected pla- 
nar graphs are especially important. We start with a simple but useful fact. A cycle 
C in a graph G is called separating, if either it has a chord, or G\V(C) has at least 
two connected components. (We can think of considering every chord of C as a con- 
nected component of the point set G\C, where G and C are considered as subsets 
of the plane.) 


Proposition 2.4. In a 3-connected planar graph a cycle bounds a country if and 
only if it is not separating. 


Since to be separating is an intrinsic property of a cycle in the graph (i.e., it 
does not depend on the way the graph is embedded), this proposition implies: 


Corollary 2.5. Every simple 3-connected planar graph has an essentially unique 
embedding in the plane in the sense that the set of cycles that bound countries is 
uniquely determined. 
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Once we have identified these cycles, gluing a topological disk on each of them, 
we get a (topological) sphere with the graph embedded in it. 
The following related characterization of 3-connected planar graphs was proved 


n |Tutte 1963). 


Theorem 2.6. Let G be a 3-connected graph. Then every edge of G is contained in 
at least two nonseparating cycles. G is planar if and only if every edge is contained 
in exactly two nonseparating cycles. 


2.1.4. Planar separation. The following important theorem is the basis of 
many recursive algorithms for planar graphs |Lipton—Tarjan 1979}. 


Theorem 2.7 (Planar Separator Theorem). Every planar graph G on n nodes 
contains a set S CV such that |S| < /8n, and every connected component of G\ S$ 
has at most 2n/3 nodes. 


One can play with the two constants in the bounds on the size of S and the 
size of the connected components of G\S, and many variations and improvements 
have been found, which we don’t discuss here. We will describe a proof based on 
circle packing, with a weaker bound of 3n/4 on the sizes of the components, but 
stronger bound 2,/n on the size of any component of G\S, in Section The 
simple example of a k x k grid shows that (up to the constants) the theorem is best 


possible (Exercise|2.10). 


2.2. Straight line representation 


Planarity is defined in terms of embedding the graph in the plane in the topo- 
logical sense. How “nice” can we make this drawing? We’ll talk a lot about this 
question; but as a warm-up, let us answer the first question that comes to mind: do 
we need curves to represent the edges in the planar embedding of a graph, or can we 
get by using straight line segments only? It is clear that to make two parallel edges 
disjoint (up to their endpoints), at least one of them must bend. The Wagner—Fary 
Theorem shows that this is the only situation that forces 
bending. 


Theorem 2.8. Every simple planar map can be drawn in the plane with straight 
edges. 


Figure [2.6]illustrates the theorem. Since this theorem will follow from several 
of our later constructions, we do not give the simple induction proof here (which is 
described in many textbooks and internet sites). 


SOy 


FIGURE 2.6. Drawing a simple planar map with curvy and straight edges. 
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2.2.1. Skeletons of polytopes. Let P be a 3-polytope (see Section [C.1] for 
basic terms and facts about polytopes). 


Proposition 2.9. The skeleton of every 3-polytope is a 3-connected planar graph. 


We describe the simple proof, because this is our first example of how a geomet- 
ric representation implies a purely graph-theoretic property, namely 3-connectivity. 


Proof. Planarity of Gp can be proved by constructing an embedding called 
the Schlegel diagram of the polytope. Let F' be any facet of P, and let x be a point 
that is outside P but very close to an interior point of fF’; more precisely, assume 
that the plane © of F' separates x from P, but for every other facet F’, x is on the 
same side of the plane of F” as P. Let us project the skeleton of P from x to the 
plane &. Then we get an embedding of Gp in the plane (Figure [2.7). 


FIGURE 2.7. Projecting the skeleton of a polytope into one of the facets. 


To see that Gp is 3-connected, it suffices to show that for any four distinct 
nodes a,b, c,d there is a path from a to 6 which avoids c and d. 

If a,b,c,d are not coplanar, then let II be a plane that separates {a,b} from 
{c,d}. We can connect a and b by a polygon consisting of edges of P that stays on 
the same side of II as a and 8, and so avoids c and d. 

If a,b, c,d are coplanar, let II be a plane that contains them. One of the open 
halfspaces bounded by II contains at least one vertex of P. We can then connect 
a and b by a polygon consisting of edges of P that stays on this side of II (except 
for its endpoints a and b), and so avoids c and d. 


The converse of this last proposition is an important and much more difficult 


theorem |Steinitz 1922): 


Theorem 2.10 (Steinitz’s Theorem). A simple graph is isomorphic with the 
skeleton of a 3-polytope if and only if it is 3-connected and planar. 


Figure [2.3] illustrates the theorem. A bijection between the nodes of a simple 
graph G and the vertices of a convex polytope P in R® that gives an isomorphism 
between G and the skeleton of P is called a Steinitz representation of the graph 
G. We do not prove Steinitz’s Theorem right now, but constructions of represen- 
tations by polytopes (in fact with special properties) will follow from the material 
in sections [3.2] and [16.2| 

There may be many representations of a graph by a 3-polytope, but the set 
of these representations, up to reflection, has nice properties (among others, it 
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el DP 


FIGURE 2.8. Representing a 3-connected planar map by a polytope. 


is connected in the topological sense). In higher dimensions graphs that can be 
represented as skeletons of polytopes are not characterized, and the set of rep- 
resentations of one and the same graph can be very complicated. We refer to 
for an in-depth presentation of this problem. 

The construction of the planar embedding of Gp in the proof of Proposition[2.9] 
gives an embedding with straight edges. Therefore Steinitz’s Theorem also proves 
the Fary—Wagner theorem, at least for 3-connected graphs. It is easy to see that 
the general case can be reduced to this by adding new edges so as to make the 
graph 3-connected (see Exercise [2.1). 

Finally, we note that the Steinitz representation is also related to planar duality. 


Proposition 2.11. Let P be a convex polytope in R® with the origin in its interior, 
and let P* be its polar. Then the skeletons Gp are Gp are dual planar graphs. 


2.2.2. Voronoi and Delaunay diagrams. We define two related important 
constructions. Both of them can be generalized to any dimension, but we start in 
the plane, since this is the version leading to straight line embedded planar graphs. 

Let S C R? be a finite set. For every s € S we define the Voronoi cell V. 
consisting of those points v € V for which s € S is a point of S closest to v. It is 
easy to see that Voronoi cells are convex polygons, whose interiors are disjoint, and 
which cover the whole plane; some cells will be unbounded (Figure [2.9] left). The 
collection of Voronoi cells is the Voronoi diagram. Each cell contains a single point 
of S, which we call its capital. 

The intersection of two Voronoi cells is a convex subset of the perpendicular 
bisector of the segment connecting their capitals, and hence this intersection may 
be a semiline, a segment, a point, or the empty set. If it is a semiline or a segment, 
we call it an edge of the diagram. The intersection of three cells can be a single 
point or empty; if it is a single point, we call it a vertex. For every vertex x, the 
capitals of those cells containing it are at the same distance from x, and so they 
form a convex polygon, and the vertex is the center of the circumscribed circle of 
this polygon. 

This way, the Voronoi cells define a planar graph, together with a straight 
line embedding in the plane, with the caveat that we need to create a “vertex at 
infinity” , and think of each semiline edge as ending at this new vertex. We call this 
graph G the Voronoi graph defined by S. 

The dual graph G* of the Voronoi graph G, called the Delaunay graph on S, is 
also very interesting. Its nodes are the points in the starting set S, and two of these 
nodes u and v are connected by an edge if the two Voronoi cells share an edge. 
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FIGURE 2.9. Left: the Voronoi and Delaunay diagrams of a set of 
points in the plane, as duals of each other. (Two vertices of the 
Voronoi graph are not shown: a vertex way down, and the “vertex 
at infinity”. Right: the Delaunay graph and the circumscribed 
circles of its countries. 


The circle about any internal point y of this edge, going through the two nodes, 
contains no further point of S in its interior or boundary: indeed, the cell defined 
by such a point would contain y in its interior or on its boundary, so y could not 
be in the cell of u, or there would be a third cell containing y. 

This leads to an alternative definition of the Delaunay graph: we connect two 
points of S by an edge if and only if there is a circle through both of them that 
contains not third point of S$ in its interior or on its boundary. In particular, every 
edge of the convex hull of S is subdivided by the points of $ it contains, to give a 
path in G*. 

Circles around a vertex x of the Voronoi graph that go through the nearest 
points of S' contain three or more points of S. These points form a convex polygon, 
which is the country of the Delaunay graph corresponding to x under duality. So 
every bounded country of the Delaunay graph has a circumscribed circle. 


Remark 2.12. We can generalize these constructions to Euclidean spaces of any 
dimension. For every finite set S C R%, and every subset T C S, we say that T isa 
Delaunay cell if there is a point x € R? such that the set of points of S closest to x 
is exactly T. The Voronoi cell corresponding to this Delaunay cell is the set of all 
such points x. The dimensions of a Voronoi cell and of the corresponding Delaunay 
cell are inversely related: they sum to d+1. These cells have many nice geometric 
properties (for example, they are convex polyhedra; see Exercise [2.13). 

The definitions above can be generalized to any metric space (V,d), but we only 
need a simplified form in this book. We define the Voronoi partition induced by a 
finite set S C V as the partition that has a partition class V, for each s € S, and 
every point v € V is put in a class V, for which s € S is a point of S closest to v. 
Ties can be broken arbitrarily. The partition classes in the Voronoi partitions will 
not be so nice in general: see Figure [2.10] for the Euclidean and ¢; (“Manhattan”) 
distance in the plane. The construction is useful nevertheless. 


Exercise 2.1. Let G be a simple planar graph. Prove that you can add edges to 
G so that you make it 3-connected while keeping it simple and planar. 
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FIGURE 2.10. Voronoi cells of a finite point set in the plane in the 
Euclidean and in the Manhattan distance 


Exercise 2.2. (a) Prove Euler’s Formula. (b) Find a relationship between the 
numbers of nodes, edges and countries of a map in the projective plane and on 
the torus. 


Exercise 2.3. Let G be a planar map and H, a spanning subgraph of G. Let 
H* denote the spanning subgraph of the dual map G* consisting of those edges 
in E(G*) that are not crossed by any edge of H. Prove that the number of 
components in H and H™ are related by c(H*) = c(H)+|E(HA)|—|V(G)|+1. 


Exercise 2.4. Let G be a simple 3-connected planar map. Prove that we can 
assign each corner either to its country or to its node so that each country is 
assigned to 2 corners, two nodes are not assigned to any corner, and each of the 
remaining nodes is assigned to 2 corners. 


Exercise 2.5. Construct a 2-connected simple planar graph on n nodes which 
has an exponential (in n) number of different embeddings in the plane. (Recall: 
two embeddings are considered different if there is a cycle that is the boundary 
cycle of a country in one embedding but not in the other.) 


Exercise 2.6. Prove that in every simple graph embedded in the projective plane 
whose edges are 2-colored, there is at least one quiet node. No quiet node can be 
guaranteed on the torus. 


Exercise 2.7. (a) Let L be a finite set of lines in the plane, no two of which are 
parallel, and not all going through the same point. Prove that there is a point 
where exactly two lines in LE go through. 

(b) Let S' be a finite set of points in the plane, not all on a line. Prove that there 
is a line which goes through exactly two points in S. 


Exercise 2.8. Let G bea triangulation of the plane. (a) Prove that G has an edge 
which is contained in two triangles only. (b) Prove the Wagner—Fary Theorem|[2.8] 
based on (a). 


Exercise 2.9. (a) Let L be a finite set of lines in the plane, not going through 
the same point. Color these lines red and blue. Prove that there is a point where 
at least two lines in L intersect and all the lines through this point have the same 
color. 


(b) Let S be a finite set of points in the plane, not all on a line. Color these points 
red and blue. Prove that there is a line which goes through at least two points in 
S and all whose points have the same color. 


Exercise 2.10. If G is a kx k grid, then every set S of nodes separating G into 
parts smaller that (1—c)k? has at least /2ck nodes. 


Exercise 2.11. The skeleton of every d-polytope is d-connected. 
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Exercise 2.12. The skeleton of every d-polytope has a Kai1 minor 
Griinbaum—Motzkin 1963}). 


Exercise 2.13. Prove that Voronoi cells are convex polyhedra in every dimension, 
and every face of a Voronoi cell is itself a Voronoi cell. 


Exercise 2.14. Let S be a finite set in the x-y plane. Introduce a third coordinate 
z, consider the set $’ of points (2, y, x+y") in 3-space, where (x,y) € S, and 
the convex hull P = conv($’). Prove that projecting those facets of P to the x-y 
plane whose outer normal points “down”, you get exactly the bounded countries 
of the Delaunay graph on S. 
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CHAPTER 3 


Rubber Bands 


3.1. Energy, forces, and centers of gravity 


Let us start with an informal description. Let G be a connected graph. Replace 
the edges by ideal rubber bands (satisfying Hooke’s Law). Think of the nodes in 
a nonempty subset S C V as nailed to given positions in d-space (thinking of 
the plane, d = 2, will be enough for a while), but let the other nodes settle in 
equilibrium (Figure (3-1). (We are going to see that this equilibrium position is 
uniquely determined.) The nodes in S' will be called nailed, and the other nodes, 
free. We call this equilibrium position of the nodes the rubber band representation 
of G in R? extending the representation of the nailed nodes. We represent the edges 
by straight line segments (the rubber bands don’t get tangled). 


FIGURE 3.1. Rubber band representation of a planar graph and 
of the Petersen graph. 


To make this precise, let u; = (wi1,..., ia)’ € R® be the position of nodei € V. 
By definition, u; = U; is prescribed for 7 € S, but arbitrary for the remaining nodes. 
The energy of this representation is defined as 


(3.1) E(u) = SO juj—uyl? = So So (win —uje)?. 


We want to find the representation with minimum energy, subject to the boundary 
conditions: 


minimize €(u) 
(3.2) subject to u; = U; for alli € S. 


Note that while we use phrases like energy, the representation is defined in exact 
mathematical terms. 
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Lemma 3.1. The function €: R?&%\5) > R is strictly conver. 


Proof. In (3-1), every function (wiz —Ujk)? is convex, so € is convex. Further- 
more, moving along an (affine) line in R@*(Y\5), this function is strictly convex 
unless wiz — Uje Temains constant along this line. If this applies to each coordinate 
of each edge, then moving along the line means parallel translation of the vectors 
u;, which is impossible if at least one node is nailed. 


It is trivial that if any of the vectors u; tends to infinity, then E(u) tends to 
infinity (still assuming the boundary conditions hold, where S' is nonempty). 
Together with Lemma [3.1] this implies that the representation with minimum en- 
ergy is uniquely determined. If i € V\S, then for the representation minimizing 
the energy, the partial derivative of E(u) with respect to any coordinate of u; must 
be 0: 


(3.3) S> (ui-uj)=0 (iE V\S). 


JEN (2) 


We can rewrite this as 


(3.4) igo yy 


This equation means that every free node is placed in the center of gravity of its 
neighbors. Equation has a nice physical meaning: the rubber band connecting 
i and 7 pulls i with force u,;—u,, so states that the forces acting on i sum to 
0 (as they should at the equilibrium). It is easy to see that characterizes the 
equilibrium position. 

We will return to the 1-dimensional case of rubber band representations in 
Chapter [4] studying harmonic functions on a graph. In those terms, equation 
asserts that every coordinate function is harmonic at every free node. 

It will be useful to extend the rubber band construction to the case when the 
edges of G have arbitrary positive weights (or “strengths”). Let c;; > 0 denote the 
strength of the edge 17. We define the energy function of a representation u by 


(3.5) E(u) => y cij|u; — Uy; |?. 

ijCE 
The simple arguments above remain valid: €, is strictly convex if at least one 
node is nailed, there is a unique optimum, and for the optimal representation every 
i €V\S satisfies 


JEN (i) 
This can be rewritten as 


(3.7) cs 


= Cig Uj. 
EN (a) Cig jEN(i) 
Thus u; is no longer at the center of gravity of its neighbors, but it is still a convex 


combination of them with positive coefficients. In particular, it is in the relative 
interior of the convex hull of its neighbors. 
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3.2. Rubber bands, planarity and polytopes 


3.2.1. How to draw a graph? The rubber band method was first analyzed 
in [Tutte 1963]. In this classical paper he describes how to use “rubber bands” to 
draw a 3-connected planar graph with straight edges and convex countries. 

Let G be a 3-connected planar graph, and let po be any country of it. Let 
Co be the cycle bounding po. Let us nail the nodes of Co to the vertices of a 
convex polygon Po in the plane, in the appropriate cyclic order, and let the rest 
find its equilibrium. We draw the edges of G as straight line segments connecting 
the appropriate endpoints. Figure [3.2]shows the rubber band representation of the 
skeletons of the five platonic bodies. 


LAX AX AS 
DH) 


FIGURE 3.2. Rubber band representations of the skeletons of pla- 
tonic bodies 


By the above, we know that each node not on Cp is positioned at the center 
of gravity of its neighbors. Tutte’s main result about this representation is the 
following: 


Theorem 3.2. If G is a simple 3-connected planar graph, then every rubber band 
representation of G (with the nodes of a particular country po nailed to a convex 
polygon) gives a straight-line embedding of G in the plane. In addition, each country 
is a convex polygon. 


Proof. Let u: V — R? be this rubber band representation of G. Let @ be a 
line intersecting the interior of the polygon Po, and let Up, U; and Uz denote the 
sets of nodes of G mapped on £ and on the two (open) sides of @, respectively. The 
key to the proof is the following claim. 


Claim 1. The sets U, and Uz induce connected subgraphs of G. 


Let us prove this for U;. Clearly the nodes of the outer cycle pp in U; form 
a (nonempty) path P,;. We may assume that ¢ does not go through any node 
(by shifting it very little in the direction of U;) and that it is not parallel to any 
line connecting two distinct positions (by rotating it with a small angle). Let 
a € U,\V(Cpo), we show that it is connected to P, by a path in U,. Since ug is 
a convex combination of the positions of its neighbors, it must have a neighbor a, 
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such that ug, is in U; and at least as far away from @ as ug. By our assumption 
that @ is not parallel to any edge, either u,, is strictly farther from @ than ug, or 
Ug, = Ug. 

At this point, we have to deal with an annoying degeneracy. There may be 
several nodes represented by the same vector u, (later it will be shown that this does 
not occur). Consider all nodes represented by u,, and the connected component H 
containing a of the subgraph of G induced by these nodes. If H contains a nailed 
node, then it contains a path from a to P,, all in U,. Else, there must be an edge 
connecting a node a’ € V(#) to a node outside H (since G is connected). Since 
the system is in equilibrium, a’ must have a neighbor a; such that ug, is farther 
away from ¢ than ug = ug (here we use again that no edge is parallel to 2). Hence 
a, € Uj, and thus a is connected to a; by a path in Uj. 

Either a, is nailed (and we are done), or we can find a node az € U,; such that 
a, is connected to ag by a path in Uj, and ug, is farther from @ than ug,, etc. This 
way we get a path Q in G that starts at a, stays in Uj, and eventually must hit P,. 
This proves the claim (F igure [3.3). 


1 


FIGURE 3.3. Left: every line cuts a rubber band representation 
into connected parts. Right: Each node on a line must have neigh- 
bors on both sides of the line. 


Next, we exclude some possible degeneracies. (We are not assuming any more 
that no edge is parallel to @: this assumption could be made for the proof of Claim 
1 only.) 


Claim 2. Every node u € Up has neighbors both in U, and U3. 


This is trivial if u € V(Cp), so suppose that wu is a free node. If u has a neighbor 
in U,, then it must also have a neighbor in U2; this follows from the fact that u, is 
the center of gravity of the points representing its neighbors. So it suffices to prove 
that not all neighbors of u are contained in Up. 

Let T be the set of nodes wu € Up with N(u) C Up, and suppose that this 
set is nonempty. Consider a connected component H of G/T] (H may be a single 
node), and let S' be the set of neighbors of H outside H. Since V(H)US' C Up, the 
set V(H)US' cannot contain all nodes, and hence S$ is a cutset. Thus |S] > 3 by 
3-connectivity. 

Ifa € S, then a € Up by the definition of $, but a has a neighbor not in Ug, and 
so it has neighbors in both U,; and U2 by the argument above (see Figure [3.3). The 
set V(H) induces a connected graph by definition, and U; and U2 induce connected 
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subgraphs by Claim 1. So we can contract these sets to single nodes. These three 
nodes will be adjacent to all nodes in S. So we have contracted G to K3 3, which 
is a contradiction, since G is planar. This proves Claim 2. 


Claim 3. Every country has at most two nodes in Up. 


Suppose that a, b,c € Up are nodes of a country p. Clearly p 4 po. Let us create 
a new node d and connect it to a, b and c; the resulting graph G’ is still planar. 
On the other hand, the same argument as in the proof of Claim 1 (with V(H) =d 
and S = {a,b,c}) shows that G’ has a K3.3 minor, which is a contradiction. 


Claim 4. Let p and q be the two countries sharing an edge ab, where a,b € Uo. 
Then V (pi) \ {a,b} C Uy and V (pe) \ {a,b} C Ug (or the other way around). 


Suppose not, then p has a node c # a,b and q has a node d £ a,b such that 
(say) c,d € U,. (Note that c,d ¢ Up by Claim 3.) By Claim 1, there is a path 
P in U; connecting c and d (Figure [3.4). Claim 2 implies that both a and b have 
neighbors in U2, and again Claim 1, these can be connected by a path in U2. This 
yields a path P’ connecting a and b whose inner nodes are in Uz. By their definition, 
P and P’ are node-disjoint. But look at any planar embedding of G: the edge ab, 
together with the path P’, forms a Jordan curve that does not go through c and d, 
but separates them, so P cannot exist. 


FIGURE 3.4. Two adjacent countries having nodes on the same 
side of ¢ in the rubber band representation (left), and the suppos- 
edly disjoint paths in the planar embedding (right). 


Claim 5. The boundary of every country q is mapped onto a convex polygon Py. 


This is immediate from Claim 4, since no edge of a country, extended to a line, 
can intersect its interior. 


Claim 6. The interiors of the polygons P, (q # po) are disjoint. 


Let x be a point inside P,,, we want to show that it is covered by one Py only. 
Clearly we may assume that x is not on the image of any edge. Draw a line through 
x that does not go through the position of any node, and see how many times its 
points are covered by interiors of such polygons. As we enter P,,, this number 
is clearly 1. Claim 4 says that as the line crosses an edge, this number does not 
change. So x is covered exactly once. 

Now the proof is essentially finished. Suppose that the images of two edges have 
a common point (other than their common endpoints). Then two of the countries 
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incident with them would have a common interior point, which is a contradiction 
except if these countries are the same, and the two edges are consecutive edges of 
this country. 


Before going on, let’s make a couple of remarks about this drawing method. The 
key step, namely Claim 1, is very similar to a basic fact concerning convex polytopes, 
mentioned in Appendix [C1] that every hyperplane intersecting the interior of the 
polytope cuts the skeleton into connected parts. Note that the proof of Claim 1 
did not make use of the planarity of G (see Exercise [3.4] and also Section [16.5). 

Tutte’s method, as described above, is a very efficient procedure to find straight- 
line embeddings of 3-connected planar graphs in the plane. These embeddings look 
nice for many graphs (as our figures show), but they may have bad parts, like points 
getting too close. Figure [3.5] shows a simple situation in which positions of nodes 
get exponentially close to each other and to the midpoint of an edge. You may 
play with the edge weights, but finding a good weighting adds substantially to the 
algorithmic cost. 


FIGURE 3.5. The rubber band representation can lead to crowding 
of the nodes. Each node on the middle line will be placed at or 
below the center of gravity of the triangle formed by the lower edge 


and the node immediately above it. So the distance between the 
k-th node from the top and the lower edge decreases faster than 


a, 


A reasonable way to exclude nodes being positioned too close is to re- 
quire that their coordinates are integers. Then the question is, of course, 
how to minimize these coordinates. In which rectangles [0,a] x [0,b] can ev- 
ery planar graph on n nodes be squeezed in so that we still get a straight 
line embedding? It turns out that this can be achieved with a,b = O(n) 
Fraysseix—Pach—Pollack 1990) |Schnyder 1990], with several good bounds on the 
appropriate pairs (a,b) (Chrobak—Nakano 1998}. 

Representing edges by straight lines is not always the most important format 
of useful drawings. In some very important applications of planar embedding, 
most notably the design of integrated circuits (chips), the goal is to embed the 
graph in a grid, so that the nodes are drawn on gridpoints, and the edges are 
drawn as zig-zagging grid paths. (Of course, we must assume that no degree is 
larger than 4.) This way all nonzero distances between nodes and/or edges are 
automatically at least 1 (the edge-length of the grid). Besides trying to minimize 
the size or area of the grid in which the embedding lies, it is very natural (and 
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practically important) to minimize the number of bends. The good news is that 
this can be achieved within a very reasonable area (O(n”)) and with O(n) bends 
[Tamassia—Tollis 2014]. See the Handbook of Graph Drawing and 
Visualization for details. 


3.2.2. How to lift a graph? An old construction of Cremona and Maxwell 
can be used to “lift” Tutte’s rubber band representation to a Steinitz representation. 
We begin with analyzing the reverse procedure: projecting a convex polytope onto 
a face. This procedure is similar to the construction of the Schlegel diagram from 
Section but we consider orthogonal projection instead of projection from a 
point. 

Let P be a convex 3-polytope, let F' be one of its faces, and let © be the plane 
containing H. Suppose that for every vertex v of P, its orthogonal projection onto 
A is an interior point of F; we say that the polytope P is straight over the face F. 


Theorem 3.3. (a) Let P be a 3-polytope that is straight over its face F', and let 
G be the orthogonal projection of the skeleton of P onto the plane of F. Then we 
can assign positive strengths to the edges of G so that G will be the rubber band 
representation of the skeleton with the vertices of F' nailed. 


(b) Let G be a 3-connected planar graph, and let T be a triangular country of G, 
and let A be a triangle in a plane %. Then there is a convex polytope P in 3-space 
such that T is a face of P, and the orthogonal projection of P onto the plane & 
gives the rubber band representation of G obtained by nailing T to A. 


In other words (b) says that we can assign a number 7; € R to each node i € V 
such that n; = 0 fori € V(T), 7, > 0 fori €e V\V(Z), and the mapping 


u; Ui1 
inw=(%) = Ui2 eR? 
Mi 


is a Steinitz representation of G. 


Example 3.4 (Triangular Prism). Consider the rubber band representation of 
a triangular prism (or of a triangular frustum, a truncated tetrahedron) in Figure 
left. If this is an orthogonal projection of a convex polyhedron, then the lines 
of the three edges pass through one point: the point of intersection of the planes of 
the three quadrangular faces. It is easy to see that this condition is necessary and 
sufficient for the picture to be a projection of a triangular frustum. To see that it is 
satisfied by a rubber band representation, it suffices to note that the inner triangle 
is in equilibrium, and this implies that the lines of action of the forces acting on it 
must pass through one point. 4 


Now we are ready to prove theorem |3.3 


Proof. (a) Let’s call the plane of the face F “horizontal”, spanned by the 
first two coordinate axes, and the third coordinate direction “vertical”, so that the 
polytope is “above” the plane of F’. For each face p, let gp be a normal vector. Since 
no face is vertical, no normal vector g, is horizontal, and hence we can normalize 
g, so that its third coordinate is 1. Clearly for each face p, g, will be an outer 
normal, except for p = F’, when g, is an inner normal (Figure 8.6). 
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FIGURE 3.6. Left: The rubber band representation of a triangular 
prism is the projection of a polytope. Right: Vertical projection 
of a polytope into one of its faces. 


Write g, = gee Let ij be any edge of G, and let p and q be the two countries 


on the left and right of 17. Then 
(3.8) (hy —hy)"(ui— uj) =0. 


Indeed, both g, and gy are orthogonal to the edge viv; of the polytope, and 
therefore so is their difference, and 


T hy —hy : u;— Uj T 
(h, —h,) (uj—uj) = 0 aa = (p—&q) (Vi—Vj) = 0. 
m5 
We have hr = 0, since the face T is horizontal. 


Let R denote the counterclockwise rotation in the plane by 90°, then it follows 
that h, —h, is parallel to R(u; —u,), and so there are real numbers c;; such that 


We claim that c,; > 0. Let k be any node on the boundary of p different from 7 
and j. Then ux is to the left from the edge 77, and hence 

(3.10) (uj, —u;) "R(u; — uj) > 0. 

Going up to the space, convexity implies that the point v; is below the plane of 
the face g, and hence fiVk < gIVi- Since EhVk = BhVi, this implies that 


- 
(3.11) «,; (R(u; - u;)) (u,—u,;) = (h,— h,) ‘(ux —u;) = (gp— Bq) (Vi —v;)>0. 
Comparing with (3.10), we see that c; > 0. 

To complete the proof of (a), we argue that the projection of the skeleton is 
indeed a rubber band embedding with strengths c;;, with F’ nailed. We want to 
prove that every free node 2 is in equilibrium, i.e., 


(3.12) S- ej (uj — uj) = 0. 
JEN (i) 
Using the definition of c;;, it suffices to prove that 
SS cijR(uj—ui) = S > (bp, —hy,) =0, 
jEN(i) jEN(i) 
where p; is the face to the left and q; is the face to the right of the edge 77. But this 
is clear, since every term occurs once with positive and once with negative sign. 


Let us make a remark here that will be needed later. Using that not only 
Ep — &q, but also gy, is orthogonal to v;—v;, we get that 


0= gn(vj —Vi)= h}(u, — uj) +15 — Ni; 
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and hence 


(b) The proof consists of going through the steps of the proof of part (a) in 
reverse order: given the Tutte representation, we first reconstruct the vectors h, so 
that all equations (3.8) are satisfied, then using these, we reconstruct the numbers 
7 So that equations (3.13) are satisfied. It will not be hard to verify then that we 
get a Steinitz representation. 

We need a little preparation to deal with edges on the boundary triangle. Recall 
that we can think of F;; = c;;(u;—u;) as the force with which the edge ij pulls its 
endpoint 7. Equilibrium means that for every free node 2, 


(3.14) Se 
JEN (A) 
This does not hold for the nailed nodes, but we can modify the definition of Fj; 
along the three boundary edges so that Fj; remains parallel to the edge u; — u; and 
(3.14) will hold for all nodes (this is the only point where we use that the outer 
country is a triangle). The only complication is that if we write Fj; = c;;(uj — uz) 
for the boundary edges, then we have to allow negative strengths c;;; this will cause 
not trouble in the computations. The existence of such forces F;; is natural by a 
physical argument: let us replace the outer edges by rigid bars, and remove the 
nails. The whole framework will remain in equilibrium, so appropriate forces must 
act in the edges ab, bc and ac to keep balance. To translate this to mathematics, 
one has to work a little; this is left to the reader as Exercise [3.5] 
We claim that we can choose vectors h, for all countries p so that 


(3.15) h, —h, = RF;; 


if 77 is any edge and p and q are the two countries on its left and right. This follows 
by a “potential argument”, which will be used several times in our book. Starting 
with hr = 0, and moving from country to adjacent country, equation will 
determine the value of h, for every country p. What we have to show is that we 
do not run into contradiction, i.e., if we get to the same country p in two different 
ways, then we get the same vector h,. This is equivalent to saying that if we walk 
around a closed cycle of countries, then the total change in the vector hy, is zero. It 
suffices to verify this when we move around countries incident with a single node. 
In this case, the condition boils down to 


S) BFg=0, 
iEN(j) 
which follows by (3.14). This proves that the vectors h, are well defined. 

Second, we construct numbers 7; satisfying by a similar argument (just 
working on the dual graph). We set 7; = 0 if 7 is a node of the unbounded country. 
Equation tells us what the value at one endpoint of an edge must be, if we 
have it for the other endpoint. 

One complication is that gives two conditions for each difference 7; —7;, 
depending on which country incident with it we choose. But if p and q are the two 
countries incident with the edge ij, then 


h}(u, —uj)—hj(u; —u;) = (h, —h,)'(u; —u,) = (RFi;) "(uj —u,) = 0, 
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since F;; is parallel to u;—u,; and so RF;; is orthogonal to it. Thus the two 
conditions on the difference n;—7; are the same. 

As before, equation determines the values 7;, starting with ng = 0. To 
prove that it does not lead to a contradiction, it suffices to prove that the sum 
of changes is 0 if we walk around a country p. In other words, if C is the cycle 
bounding a country p (oriented, say, clockwise), then 


ij €B(C) 
which is clear. It is also clear that 7m, = 7, = 0. 

Now define v; = c for every node i, and gp, = (»?) for every country p. It 
remains to prove that 7 ++ v; maps the nodes of G onto the vertices of a convex 
polytope, so that edges go to edges and countries go to facets. We start with 
observing that if p is a country and 27 is an edge of p, then 


€,¥i—S,Vz = hy(u;— uy) +(m—ny) =0, 
and hence there is a scalar a, so that all nodes of p are mapped onto the hyperplane 


gyx =p. We know that the image of p under 7 ++ u; is a convex polygon, and so 
the same follows for the map 71> vj. 


Ip'X=Ap 


FIGURE 3.7. Lifting a rubber band representation to a polytope. 


To conclude, it suffices to prove that if 77 is any edge, then the two convex 
polygons obtained as images of countries incident with 77 “bend” in the right way; 
more exactly, let p and q be the two countries incident with 77, and let Q, and Q, 
be two corresponding convex polygons (see Figure 8.7). We claim that Q, lies on 
the same side of the plane Spx = dp as the bottom face. Let vz be any vertex of the 
polygon Q, different from v; and v;. We want to show that SpVk < Qp. Indeed, 


BVk Ay = SVi i S.Vi a Sh (VK —Vvi) = (fp— Sq) "(VE =) 


(since both v; and v; lie on the plane ae =a), 


0 


(since u, lies to the right from the edge u;u,). 


es 
= ue (vi —Vvi) = (hp —hy) "(uy — uy) = (RFiz) "(ux — ui) < 0 
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Theorem [3.3] proves Steinitz’s theorem in the case when the graph has a trian- 
gular country. If this is not the case, then Proposition [2.1] implies that the graph 
has a node of degree 3, which means that the dual graph has a triangular coun- 
try. So we can represent the dual graph as the skeleton of a 3-polytope. Choosing 
the origin in the interior of this polytope, and considering its polar, we obtain a 
representation of the original graph. 


3.3. Rubber bands and connectivity 


Rubber band representations can be related to graph connectivity, and can be 
used to give a test for k-connectivity of a graph (Linial-Lovasz—Wigderson 1988). 
It turns out that connectivity of the underlying graph is related to the degeneracy 
of the representation—a notion various versions of which will be crucial in many 
parts of this book. 


3.3.1. Degeneracy: essential and nonessential. We start with a discus- 
sion of what causes degeneracy in rubber band embeddings. Consider the two 
graphs in Figure It is clear that if we nail the nodes on the convex hull, and 
then let the rest find its equilibrium, then there will be a degeneracy: the grey 
nodes will all move to the same position. However, the reasons for this degeneracy 
are different: In the first case, it is due to symmetry; in the second, it is due to 
the node that separates the grey nodes from the rest, and thereby pulls them onto 
itself. 

One can distinguish the two kinds of degeneracy in another way: In the first 
graph, the strengths of the rubber bands must be strictly equal; varying these 
strengths it is easy to break the symmetry and thereby get rid of the degeneracy. 
However, in the second graph, no matter how we change the strengths of the rub- 
ber bands (as long as they remain positive), the grey nodes will always be pulled 
together into one point. 


AL AN 


FIGURE 3.8. Two reasons why two or more nodes end up on top 
of each other: symmetry, or a separating node 


Figure [3.9] illustrates a bit more delicate degeneracy. In all three pictures, the 
grey points end up collinear in the rubber band embedding. In the first graph, the 
reason is symmetry again. In the second, there is a lot of symmetry, but it does 
not explain why the three grey nodes are collinear in the equilibrium. (It is not 
hard to argue though that they are collinear: a good exercise!) In the third graph 
(which is not drawn in its equilibrium position, but before it) there are two nodes 
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separating the grey nodes from the nailed nodes, and the grey nodes will end up 
on the segment connecting these two nodes. In the first two cases, changing the 
strength of the rubber bands will pull the grey nodes off the line; in the third, this 
does not happen. 


FIGURE 3.9. Three reasons why three nodes can end up collinear: 
symmetry, just accident, or a separating pair of nodes 


3.3.2. Connectivity and degeneracy. Our goal is to prove that essential 
degeneracy in a rubber band embedding is always due to low connectivity. We 
start with the easy direction of the connection, relating connectivity and affine 
rank, formalizing the examples from the previous section. 


Lemma 3.5. Let G be a graph and S,T CV. Then for every rubber band repre- 
sentation u of G with S nailed, rkag(u(T)) < «(S,T). 


Proof. There is a subset U C V with |U| = «(S,T) such that V\U contains 
no (S,7T)-paths. Let W be the union of connected components of G\U containing 
a vertex from TJ. Then u, restricted to W, gives a rubber band representation of 
G[W] with boundary U. Clearly u(W) C conv(u(U)), and so 


rkag(u(Z')) < rkag(u(W)) = rkag(u(U)) < |U| = «(S,T). 


The Lemma gives a lower bound on the connectivity between two sets S' and 
T. The following theorem asserts that if we take the best convex representation, 
this lower bound is tight: 


Theorem 3.6. Let G be a graph and $,T C V with K(S,T) =d+1. Then G has a 
rubber band representation in R%, with suitable rubber band strengths, with S nailed 
such that rkag(u(T)) =d+1. 


This theorem has a couple of consequences about connectivity not between sets, 
but between a set and any node, and between any two nodes. 


Corollary 3.7. Let G be a graph,d >1 and S CV. Then G has a rubber band 
representation in general position in R@ with S nailed if and only if no node of G 
can be separated from S by fewer than d+1 nodes. 


Corollary 3.8. A graph G is k-connected if and only if for every S C V with 
|S| = k, G has a rubber band representation in R*—! in general position with S 
nailed. 


To prove Theorem [3.6] we choose generic edge weights. 
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FIGURE 3.10. Three nodes accidentally on a line. Strengthening 
the edges along the three paths pulls them apart. 


Theorem 3.9. Let G be a graph and S,T C V with K(S,T) > d+1. Choose 
algebraically independent edgeweights c,; > 0. Map the nodes of S into R?¢ in general 
position. Then the rubber band extension of this map satisfies rkag(u(T)) =d+l. 


If the edgeweights are chosen randomly, independently and uniformly from 
[0, 1], then the algebraic independence condition is satisfied with probability 1. 


Proof. The proof will consist of two steps: first, we show that there is some 
choice of the edgeweights for which the conclusion holds; then we use this to prove 
that the conclusion holds for every generic choice of edge-weights. 

For the first step, we use that by Menger’s Theorem|B.2| there are d+1 disjoint 
paths P; connecting a node i € S with a node i’ € T. The idea is to make the 
rubber bands on these paths very strong (while keeping the strengths of the other 
edges fixed). Then these paths will pull each node 7’ very close to i. Since the 
positions of the nodes 7 € S are affine independent, so will be the positions of the 
nodes i’ (Figure [3.10). 

To make this precise, let D be the diameter of the set {u(S)}. Fix any R > 0, 
and define strengths c,;; of the rubber bands by 


jee R, if ij is an edge of one of the paths P,., 
aa 1, otherwise. 


Let u be the rubber band extension of the given mapping of the nodes of S with 
these strengths. 

Recall that f minimizes the potential €, (defined by (3-5)) over all representa- 
tions of G with the given nodes nailed. Let v be the representation with vj; = uj 
if j € P; (in particular, v; = u, if 2 € S$); for any node 7 not on any Fj, let v; be 
any point in conv{u(S)}. In the representation v the edges with strength R have 
0 length, and so 

Es(u) < Es(v) < D?m. 
On the other hand, for every edge uv on one of the paths P;, 
€s(u) > Riu,—u,|’, 


and hence 
Dm 
VR 


|u,, — U4 | < 
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So these edges can be made arbitrarily short if we choose R large enough, and 
hence every node 7’ will be arbitrarily close to the corresponding nailed node i. 
Since {u; : i € S} are affine independent, this implies that the nodes in {uj : 7 € 
T} ={u;: 7 € S} are affine independent for a large enough value of R. 

This completes the first step. Now we argue that this holds for all possible 
choices of generic edge-weights. To prove this, we only need some general consider- 
ations. The embedding minimizing the energy is unique, and so it can be computed 
from the equations (say, by Cramer’s Rule). What is important from this is 
that the vectors u; can be expressed as rational functions of the edgeweights. Fur- 
thermore, the value det ((1 +uju, ye j=0) is a polynomial in the coordinates of the 
u,;, and so it is also a rational function of the edgeweights. We know that this 
rational function is not identically 0; hence it follows that it is nonzero for every 
algebraically independent substitution. 


Remark 3.10. Instead of rubber band embeddings, it would suffice to assume that 


each free node is placed in the convex hull of its neighbors |Cheriyan—Reif 1992). 


3.3.3. Rubber bands and connectivity testing. Rubber band represen- 
tations yield a (randomized) graph connectivity algorithm with good running time 
[Linial-Lovasz—Wigderson 1988]; however, a number of algorithmic ideas are needed 
to make it work, which we describe in their simplest form. We refer to the paper 
for a more thorough analysis. 


Connectivity between given sets. We start with describing a test checking 
whether or not two given k-tuples S and 7 of nodes are connected by k node- 
disjoint paths. For this, we assign random weights c;; to the edges, and map the 
nodes in S$ into R*~! in general position. Then we compute the rubber band 
representation extending this map, and check whether the points representing T 
are in general position. 

This procedure requires solving a system of linear equations, which is easy, but 
the system is quite large, and it depends on the choice of S and T. With a little 
work, we can save quite a lot. 

Let G be a connected graph on V = [n] and S C V, say S = [k]. Given a map 
u: S — R*-!, we can compute its rubber band extension by solving the system 
of linear equations 


(3.16) S> c(us—uj)=0 (EV\S). 

gEN(i) 
This system has (n—k)(k—1) unknowns (the coordinates of nodes i € S are con- 
sidered as constants) and the same number of equations, and we know that it has a 
unique solution, since this is where the gradient of a strictly convex function (which 
tends to co at oo) vanishes. 

At the first sight, solving takes inverting an (n—k)(k—1) x (n—k)(k—1) 
matrix, since there are (n—k)(k—1) unknowns and (n—k)(k—1) equations. How- 
ever, we can immediately see that the coordinates can be computed independently, 
and since they satisfy the same equations except for the right hand side, it suffices 
to invert the matrix of the system once. 

Below, we will face the task of computing several rubber band representations 
of the same graph, changing only the nailed set S. Can we make use of some of the 
computation done for one of these representations when computing the others? 
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The answer is yes. First, we do something which seems to make things worse: 
We create new “equilibrium” equations for the nailed nodes, introducing new vari- 
ables for the forces that act on the nails. Let f; = EN (s) Cij(uj; —u;) be the force 
acting on node 7 in the equilibrium position. Note that Ye; = f; = 0 for i ¢ S. Let 
X and Y be the matrices of the vector labelings u and f. Let DL. be the symmetric 
VxV matrix 


Cj if WE EB, 
(Le)ig = 4 Unena) Ciky t= J, 
0, otherwise 


(the weighted Laplacian of the graph). Then we can write (3.16) as XL. = Y. 
To simplify matters a little, let us assume that the center of gravity of the node 
positions is 0, so that X J = 0. We also have YJ = XL,J = 0. It is easy to check 
that 


(3.17) X=Y(L.+J)"" 


satisfies these equations (for more about this, see the “inverting the Laplacian” 
trick in Section [4.3.2). 

It is not clear that we are nearer our goal, since how do we know the matrix 
Y (ie., the forces acting on the nails)? But the trick is this: we can prescribe 
these forces arbitrarily, as long as the conditions }),., f; = 0 and f; = 0 for 7 ¢ S$ 
are satisfied. Then the rows of X will give some position for each node, for which 
XL, = Y. So in particular the nodes in V\S will be in equilibrium, so if we nail 
the nodes of S, the rest will be in equilibrium. If Y has rank d, then so does X, 
which implies that the vectors representing S will span the space R¢. Since their 
center of gravity is 0, it follows that they are not on one hyperplane. We can 
apply an affine transformation to move the points u; (¢ € S$) to any other affine 
independent position if we wish, but this is irrelevant: what we want is to check 
whether the nodes of T are in general position, and this is not altered by any affine 
transformation. 

To sum up, to check whether the graph is k-connected between S and T (where 
S,T CV, |S| = |T| = k), we select random weights c;; for the edges, select a 
convenient matrix Y, compute the matrix X = Y(Z.+J)~+, and check whether 
the rows with indices from T are affine independent. The matrix L,+ J is positive 
definite, and it has to be inverted only once, independently of which sets are we 
testing for connectivity. The matrix Y has dimensions (k—1) xn, but most of its 
elements are zero; a convenient choice for Y is 


1 0... O -1 O... 

Of «. O —-1 OQ... 
Y=]. . 

0 0... 1 -1 Oz... 


and then the matrix multiplication Y(L.+J)~+ is trivial. 


Connectivity between all pairs. If we want to apply the previous algorithm for 
connectivity testing, it seems that we have to apply it for all pairs of k-sets. Even 
though we can use the same edgeweights and we only need to invert L.+J once, 
we have to compute X = Y(L,+J)~! for potentially exponentially many different 
sets S, and then we have to test whether the nodes of T are represented in general 
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position for exponentially many sets T. The following lemma shows how to get 
around this. 


Lemma 3.11. For every verter vu € V we select an arbitrary k-subset S(v) C N(v). 
Then G is k-connected if and only if S(u) and S(v) are connected by k node-disjoint 
paths for every u and v. 


Proof. The ” only if” part follows from the well-known property of k-connected 
graphs that any two k-subsets are connected by k node-disjoint paths. The ”if” part 
follows from the observation that if S(u) and S(v) are connected by k node-disjoint 
paths, then u and v are connected by k openly disjoint paths. 


Thus the subroutine in the first part needs be called only O(n?) times. In fact, 
we do not even have to check this for every pair (u,v), and further savings can be 
achieved through randomization, using Exercises and 


Numerical issues. The computation of the rubber band representation requires 
solving a system of linear equations. We have seen (Figure[3.5) that for a graph with 
n nodes, the positions of the nodes can get exponentially close in a rubber band 
representation, which means that we might have to compute with exponentially 
small numbers (in n), which means that we have to compute with linearly many 
digits, which gives an extra factor of n in the running time. One way out is to 
solve the system in a finite field rather than in R. Of course, this ” modular” 
embedding has no physical or geometrical meaning any more, but the algebraic 
structure remains! 

For the analysis, we need the Schwartz-Zippel Lemma |Schwartz 1980 


Zippel 1979): 


Lemma 3.12. Given a polynomial over any field of degree k in m variables, the 
probability that it vanishes for a random substitution, where each variable is chosen 
uniformly from N possible values, is bounded by k/N. 


Let G be a graph and let S C V, |S| =k, and d= k-—1. Let p bea prime and 
Ce € F, for e € E. A modular rubber band representation of G (with respect to S, 
p and c) is defined as an assignment i++ uj; € Fa satisfying 


(3.18) S> cy(us—uj)=0 (Wie V\S). 


JEN (i) 


This is formally the same equation as for real rubber bands, but we work over Fy, 
so no notion of convexity can be used. In particular, we cannot be sure that the 
system has a solution. But things work if the prime is chosen at random. 


Lemma 3.13. Let N > 0 be an integer. Choose uniformly a random prime p < N 
and random weights ce € Fp (e € E). 

(a) With probability at least 1—n?/N, there is a modular rubber band represen- 
tation of G (with respect to S, p and c), such that the vectors u; (i € S) are affine 
independent. This representation is unique up to affine transformations of Fé. 


(b) Let T CV, |T| =k. Then with probability at least 1—n?/N, the represen- 
tation Ue in (a) satisfies rkag({u;: i € T}) = «(S,T). 

Proof. The determinant of the system (3.18) is a polynomial of degree at most 
n of the weights c;;. The Schwartz-Zippel Lemma gives (a). The proof of (b) is 
similar. 
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We sum up the complexity of this algorithm without going into fine details. 
In particular, we restrict this short analysis to the case when k < n/2. We 
have to fix an appropriate N; N = n° will do. Then we pick a random prime 
p< N. Since N has O(logn) digits, the selection of p and every arithmetic op- 
eration in F,, can be done in polylogarithmic time, and we are going to ignore 
these factors of logn. We have to generate O(n”) random weights for the edges. 
We have to invert (over F,) the matrix L,+J, this takes O(M(n)) operations, 
where M(n) is the cost of multiplying two nxn matrices (currently known to be 
O(n?-3"27) [Wassilevska Williams 2012]). Then we have to compute Y(L.+J)~} for 
a polylogarithmic number of different matrices Y (using Exercise b) below). 
For each Y, we have to check affine independence for one k-set in the neighbor- 
hood of every node, in O(nM (k)) time. Up to polylogarithmic factors, this takes 
O(M(n)+nM(k)) time. 

Using much more involved numerical methods, solving linear equations involv- 
ing the Laplacian and similar matrices can be done more efficiently. For details, we 


refer to [Spielman 2011] and |Spielman—Teng 2014]. 


Exercise 3.1. Prove that miny €.(u) (where the minimum is taken over all posi- 
tions u with some nodes nailed) is a concave function of the rubber band strengths 
cE R®. 


Exercise 3.2. Let x be the equilibrium position of a rubber band framework 
with the nodes in S nailed, and let y be any other position of the nodes (but with 
the same nodes nailed at the same positions). Then 


E-(y) —Ec(x) = Ec(y — x). 


Exercise 3.3. Let G be a connected graph, 04.5 CV,andu: S > R®. Extend 
ttou: V\S > R% as follows: starting a random walk at j, let i be the (random) 
node where S is first hit, and let u; denote the expectation of the vector W;. Prove 
that u is the same as the rubber band extension of U. 


Exercise 3.4. Let G be a connected graph, and let u be a vector-labeling of an 
induced subgraph H of G (in any dimension). If (H, u) is section-connected, then 
its rubber-band extension to G is section-connected as well. 


Exercise 3.5. Let u be a rubber band representation of a planar map G in 
the plane with the nodes of a country T nailed to a convex polygon. Define 
Fi; = u;—uj for all edges in E\ E(T). (a) If T is a triangle, then we can define 
Fi; € R? for ij € E(T) so that Fi; = —Fj:, Fi; is parallel to uj—uj, and 
Dien) Fis = 0 for every node i. (b) Show by an example that (a) does not 
remain true if we drop the condition that T is a triangle. 


Exercise 3.6. Prove that every Schlegel diagram with respect to a face F' can 
be obtained as a rubber band representation of the skeleton with the vertices of 
F nailed (the strengths of the rubber bands must be chosen appropriately). 


Exercise 3.7. Let G be a 3-connected simple planar graph with a triangular 
country p = abc. Let q,r,s be the countries adjacent to p. Let G* be the dual 
graph. Consider a rubber band representation u: V — R? of G with a, b,c nailed 
down (both with unit rubber band strengths). Prove that the segments represent- 
ing the edges can be translated so that they form a rubber band representation 
of G* —p with q,r,s nailed down (Figure[3.11p. 
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FIGURE 3.11. Rubber band representation of a dodecahedron with 
one node deleted, and of an icosahedron with the edges of a triangle 
deleted. Corresponding edges are parallel and have the same 
length. 


Exercise 3.8. A convex representation of a graph G (in dimension d, with bound- 
ary S CV) is a mapping of V > R?@ such that every node in V \ S is in the relative 
interior of the convex hull of its neighbors. (a) The rubber band representation 
extending any map from S C V to R® is convex with boundary S. (b) Not every 
convex representation is constructible this way. 


Exercise 3.9. In a rubber band representation, increase the strength of an edge 
between two unnailed nodes (while leaving the other edges invariant). Prove that 
the length of this edge decreases. 


Exercise 3.10. Prove that a 1-dimensional rubber band representation of a 2- 
connected graph, with boundary nodes s and t, nondegenerate in the sense that 
the nodes are all different, is an s-t-numbering (as defined in the Introduction). 
Show that instead of the 2-connectivity of G, it suffices to assume that deleting 
any node, the rest is either connected or has two components, one containing s 
and one containing t. 


Exercise 3.11. Let G be any graph, and let H be a k-connected graph with 
V(H) = V. Then G is k-connected if and only if u and v are connected by k 
openly disjoint paths in G for every edge uv € E(H). 


Exercise 3.12. Let G be any graph and k < n. (a) For ¢ pairs of nodes chosen 
randomly and independently, we test whether they are connected by k openly 
disjoint paths. Prove that if G is not k-connected, then this test discovers this 
with probability at least 1—exp(—2(n—k)t/n?). (b) For r nodes chosen randomly 
and independently, we test whether they are connected by k openly disjoint paths 
to every other node of the graph. Prove that if G is not k-connected, then this 
test discovers this with probability at least 1— ((k—1)/n)’. 


Exercise 3.13. Let G be a graph, and let Z be a matrix obtained from its 
Laplacian Lg by replacing its nonzero entries by algebraically independent tran- 
scendentals. For S,T’ C V, |S| = |T| =k, let Zs,r denote the matrix obtained 
from Z by deleting the rows corresponding to S and the columns corresponding 
to T. Then det(Zs,r) 4 0 if and only if K($,T) = k. 


CHAPTER 4 


Discrete Harmonic Functions 


Having discussed the nice geometry of rubber band embeddings, let us have a 
closer look at the one-dimensional case (or, if you prefer, at one of the coordinate 
functions). This will allow us to make some arguments from the previous chapter 
(“potential argument” and “inverting the Laplacian”) more general and precisely 
stated. 

Discrete harmonic functions are discrete analogues of harmonic functions in 
analysis, where they play a fundamental role. A “true” harmonic function can be 
defined as a twice differentiable function f : R? — R (perhaps defined only in some 
domain) satisfying the differential equation Af = 0, where A = 7“_, 02/02? is 
the Laplace operator. It is a basic fact that such functions can be characterized by 
the “mean value property”: their value at any point equals to their average on any 
ball around this point. We take this second characterization as our starting point 
for introducing an analogous notion of harmonic functions defined on the nodes of 
an arbitrary graph. 

Discrete harmonic functions play an important role in the study of random 
walks (after all, the averaging in the definition can be interpreted as expectation 
after one move). They also come up in the theory of electrical networks, and also 
in statics. This provides a connection between these fields, which can be exploited. 
In particular, various methods and results from the theory of electricity and statics, 
often motivated by physics, can be applied to provide results about random walks, 
and vice versa. From our point of view, their main applications will be in the theory 
of discrete analytic functions in later chapters. 


4.1. Basics 


4.1.1. Definition, uniqueness and existence. Let G be a connected simple 
graph. A function h: V — R is called harmonic at a node v € V if 


1 
dean) oe h(u) = h(v) 


(the value of h at a node v is the average of its values at the neighbors of v.) We 
can also write this condition as 


(4.2) S$ (h(v)—A(u)) = 0. 


ue N(v) 


(4.1) 


A node where a function is not harmonic is called a pole of the function. Figure[4.]] 
shows a couple of very simple functions harmonic at many nodes, with black nodes 
denoting their poles. 

If we consider h as a vector indexed by the nodes, and Lh is the vector obtained 
by applying the Laplacian L = Lg to h, then we can express as (Lh), = 0. 
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FIGURE 4.1. (a) A harmonic function on a cycle, with two poles 
(black nodes). (b) A harmonic function on a triangular grid, with 
poles at the center and on the boundary. 


It is useful to note that, since each column sum of L is zero, 
(4.3) S > (Lh)y = 1"Lh = 0. 
vEV 


So every function is harmonic “on the average”. 
We can extend this notion to multigraphs by replacing (4.1) by 


1 
(4.4) S> Alu) = h(v). 
deg(v) ese 
Another way of writing this is 
(4.5) S> au» (h(v) — h(u)) = 0, 
ueV 


where dy, is the multiplicity of the edge uv. We could further generalize this by 
assigning arbitrary positive “strengths” 6,, to the edges of G, and say that h is 
harmonic at v (on the weighted graph (G, 3)) if 


(4.6) Y= Buv(h(v) —h(u)) = 0. 
ueV 
In what follows, we will restrict our arguments to simple graphs (to keep things 
simple), but all of the results below could be extended to multigraphs and weighted 
graphs in a straightforward way. 
Every constant function is harmonic at each node. On the other hand, it was 


observed very early [|Katona—Korvin 1968} that this is all: 


Proposition 4.1. Every nonconstant function on the nodes of a connected graph 
has at least two poles. 


This implies that ifh: V — R is a nonconstant function, then Lh 4 0, so the 
nullspace of L is one-dimensional: it consists of the constant functions only. 


Proof. Let S be the set where the function assumes its maximum, and let S$’ 
be the set of those nodes in S' that are connected to any node outside S. Then 
every node in S’ must be a pole, since in (4-1), every value h(u) on the left had 
side is at most h(v), and at least one is less, so the average is less than h(v). Since 
the function is nonconstant, S is a nonempty proper subset of V, and since the 
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graph is connected, $’ is nonempty. So there is a pole where the function attains 
its maximum. Similarly, there is another pole where it attains its minimum. 


The proof above shows that every non-constant function on the nodes of a 
graph assumes its maximum as well as its minimum on one of its poles. This can 
be viewed as a (very simple) version of the Mazimum Principle familiar from the 
theories of harmonic and analytic functions. 

For any two nodes there is a nonconstant harmonic function that is harmonic 
everywhere else. More generally, we have the following theorem. 


Theorem 4.2. For a connected simple graph G, nonempty set S C V and function 
ho: S—R, there is a unique functionh: V —+R extending ho that is harmonic 
at each node of V\ S. 


We call this function h the harmonic extension of ho. Note that if |S| = 1, then 
the harmonic extension is a constant function (and so it is also harmonic at S, and it 
does not contradict Proposition|4.1). If S = {a,b}, then a function that is harmonic 
outside S$ is uniquely determined by two of its values, h(a) and h(b). Scaling the 
function by a real number and adding a constant preserves harmonicity at each 
node, so if we know how to construct the harmonic extension h with (say) h(a) = 
0 and h(b) = 1, then g(v) = go(a)+(g0(b)—go(a))h(v) describes the harmonic 
extension of the function go: {a,b} — R. In this case equation is equivalent 
to saying that F;; = h(j) —h(i) defines a flow from a to b. 


Proof. The uniqueness of the harmonic extension is easy by the argument in 
the proof of Proposition [4.1] Suppose that h and h’ are two harmonic extensions of 
ho. Then g = h—h’ is harmonic on V\S, and satisfies g(v) = 0 at each v € S. If 
g is the identically 0 function, then h = h’ as claimed. Else, either its minimum or 
its maximum is different from 0. But we have seen that both the minimizers and 
the maximizers contain at least one pole, which is a contradiction. 

The existence is just the 1-dimensional case of the construction of the rubber 
band extension in Section 


There are several other ways of proving the existence; a few of these could be 
based on the models below and on the linear algebra considerations in the next 
sections. 


4.1.2. Random walks. Random walks were considered in the Introduction, 
but we need a few more definitions. Recall that a random walk on a graph G is a 
random sequence (v°,v!,...) of nodes constructed as follows: We pick a starting 
point v° from a specified initial distribution o (often a specific node), we select a 
neighbor v! of it at random (each neighbor is selected with the same probability 
1/deg(v°)), then we select a neighbor v? of this node v! at random, etc. We denote 
by o* the distribution of v*. 

For any event A expressible in terms of the random walk (v°,v',...), started 
from the distribution 7, we denote by P,(A) its probability. For example, P,,(v’ = 
u) is the probability that after t steps, the random walk started at u returns to the 
starting node. Expectations E,(X) are defined similarly for random variables X 
depending on the walk. 

In the language of probability theory, a random walk is a finite time-reversible 
Markov chain. There is not much difference between the theory of random walks 
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on directed graphs and the theory of finite Markov chains; every Markov chain can 
be viewed as random walk on a directed graph, if we allow weighted edges, and 
every time-reversible Markov chain can be viewed as a random walk on an edge- 
weighted undirected graph. The theory of random walks (finite Markov chains) has 
a large literature, and we will not be able to provide here even an introduction. 
The following discussion, based on well known and intuitive facts, can be followed 
even by those not familiar with the subject. For example, we need the fact that, 
with probability 1, a random walk on a connected graph hits every node infinitely 
often. 

Let a denote the probability distribution in which the probability of a node 
is proportional to its degree: 7, = deg(v)/(2m). This distribution is called the 
stationary distribution of the random walk. It is easy to check that if v° is selected 
from 7, then after any number of steps, v* will have the same distribution 7. (This 
explains the name “stationary distribution” ). 

Using the adjacency matrix A = Ag and the diagonal matrix D = Dg com- 
posed of the degrees, we define the transition matriz of the random walk on G as 
P=D"'A=I-D“'L. If we are at node i, then the probability of moving to 
node j is Pj;. The eigenvalues of P are the same as the eigenvalues of the symmet- 
ric matrix [—L = D'/?PD-/2, so in particular they are real. (This shows why 
the spectrum of the normalized Laplacian plays an important role in the study of 
random walks.) The largest eigenvalue of the transition matrix P is 1, and it has 
multiplicity 1 for connected graphs. It is straightforward to check that the right 
eigenvector belonging to it is 1, and the left eigenvector is given by 7. 

Let us return to harmonic functions. Given a nonempty subset S C V anda 
function ho : S'— R, we define 


(4.7) h(v) E,,(ho(av)), 


where a, is the (random) node where a random walk first hits S (this node exists 
with probability 1). Clearly h(v) = ho(v) for v € S. Furthermore, for a node 
v €V\S we have 


h(v) = Ey(ho(av)) = S> Py(w' = u)Eu(ho(au)) = : S> Alu). 


uEN(v) 
This shows that h is harmonic on V \ S. 
As a special case, if S = {s,t}, ho(s) = 0 and ho(t) = 1, then h(v) is the 
probability that a random walk starting at v visits t before s. 


Remark 4.3. Random walks on infinite graphs have quite different problems. 
Consider an infinite connected graph G with finite degrees, and specify a “root” 
node r. For each vu € V, let h(v) denote the probability that starting a random 
walk at v, we hit r at some time. Clearly h(r) = 1, and h is harmonic at every 
node other than r. 

If the function h is identically 1, then the random walk is recurrent. In this 
case, it almost surely visits every node infinitely often. If it is not recurrent, then it 
is called transient; it almost surely visits every node only a finite number of times. 
(These properties seem to depend on the root, but in fact if the graph is recurrent 
for one root, then it is recurrent for every other root.) 

For a recurrent walk, the function A is a trivial harmonic function. For a 
transient walk, we get an interesting function with a single pole. We can of course 
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restrict the function to a finite induced subgraph, but then it will have more poles, 
at all neighbors of the complementary node set. 

Basic results in probability tell us that the grid graph on Z? (in which two 
nodes are connected whenever their distance is 1) is recurrent for dimensions d < 2, 
but transient for d > 2 [Pélya 1921]. Random walks on infinite graphs is a very 
important topic, connected to our theme in many ways, but this topic does not fit 
in this book. 


4.1.3. Electrical networks. In this third construction for harmonic func- 
tions, we use facts from the physics of electrical networks like Ohm’s Law and 
Kirchhoft’s two laws. This looks like stepping out of the realm of mathematics; 
however, these results are all purely mathematical. 

To be more precise, we consider these laws as axioms. Given a connected 
network whose edges are conductors, and an assignment of potentials (voltages) to 
a subset S' of its nodes, we want to find an assignment of voltage differences Uj; 
and currents J;; to the edges ij so that Ui; = —Uji, [iy = —Ljs (so the values are 
defined if we give the edge an orientation, and the values are antisymmetric), and 
the following laws are satisfied: 

Ohm’s Law: Uj; = Rij li; (where Rj; > 0 is the resistance of edge e; this does 
not depend on the orientation). 

Kirchhoff’s Current Law: Suen Iu, = 0 for every node vu ¢ S; 


Kirchhoff’s Voltage Law: je R(C) Uj; = 0 for every cycle CG (the order of 
i and 7 is given by fixing a direction of traversing C). 

Note that every cycle has two orientations, but because of antisymmetry, these 
two oriented cycles give the same condition. Therefore, in a bit sloppy way, we can 
talk about Kirchhoff’s Voltage Law for an undirected cycle. As a degenerate case, 
we could require the Voltage Law for oriented cycles of length 2, consisting of an 
edge and its reverse, in which case the law says that U;;-+U;; = 0, which is just 
another way of saying that U is antisymmetric. 

The following reformulation of Kirchhoff’s Voltage Law will be useful for us: 
there exist potentials p(i) € R (i € V) so that Ui; = p(j)—p(t). This sounds like a 
statement in physics, but in fact it is a rather simple fact about graphs: 


Proposition 4.4. Given a connected graph G and an antisymmetric assignment 
U;; of real numbers to the oriented edges, there exist real numbers p(i) (i € V) so 
that Uiz = p(j) — p(t) if and only tf oi;em(cy Vij = 0 for every cycle C. 


Proof. It is trivial that if a “potential” exists for a given assignment of voltages 
U,;, then Kirchhoff’s Voltage Law is satisfied. Conversely, suppose that Kirchhoff’s 
Voltage Law is satisfied. We derive a couple of consequences. 


Claim 1. If W is any closed walk on the graph, then ie R(W) U;; = 0. 


This follows by induction on the length of the walk. Consider the first time 
when we encounter a node already visited. If this is the end of the walk, then the 
walk is a single oriented cycle, and the assertion follows by Kirchhoff’s Voltage Law. 
Otherwise, this node splits the walk into two closed walks, and since the sum of 
voltages is zero for each of them by induction, it is zero for the whole walk. 


Claim 2. If W, and W2 are open walks with the same starting nodes and the same 
ending nodes, then igeE(W1) Ui; = ijeE(W2) Ui;- 
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Indeed, concatenating W; with the reversal of W2 we get a closed walk whose 
total voltage is ijeR(Wr) Ui; — ijeR(We) U;;, and this is 0 by Claim 1. 

Now let us turn to the construction of the potentials. Fix any node v as a root, 
and assign to it an arbitrary potential p(v). For every other node u, consider any 
walk W from v to u, and define p(u) = Y)i;en(w) Vij. This value is independent 
of the choice of W by Claim 2. Furthermore, for any neighbor wu’ of u, considering 
the walk W+uu’, we see that p(u’) = p(u)+Uuuw, which shows that p is indeed a 
potential for the voltages U. 


To check Kirchhoff’s Voltage Law for every cycle in the graph is a lot of work; 
but these equations are not independent, and it is enough to check it for a basis of 
these equations. There are several useful ways to select a basis; here we state one 
that is particularly useful, and another one as Exercise [4.1] 


Corollary 4.5. Let G be a 2-connected planar graph and let Uj; be an antisymmet- 
ric assignment of real numbers to the oriented edges. There exist real numbers p(t) 
(i € V) so that U;; = p(j) — p(t) if and only if Kirchhoff’s Voltage Law is satisfied 
by the boundary cycle of every bounded country. 


Proof. We want to show that Kirchhoff’s Voltage Law is satisfied by every cy- 
cle C. Fix any embedding of the graph in the plane, and (say) the counterclockwise 
orientation of C. Let us sum Kirchhoff’s Voltage Law corresponding to countries 
contained inside C’, with the counterclockwise orientation of each; the terms cor- 
responding to internal edges cancel (by antisymmetry), and we get Kirchhoff’s 
Voltage Law for C. 


We have given these simple arguments in detail, because essentially the same 
argument is used repeatedly in the book (already used in Section [3.2), and we 
are not going to describe it every time; we will just refer to it as the “potential 
argument”. We could formulate and prove Proposition [4.4] and Corollary [4.5] for 
an arbitrary abelian group instead of the real numbers. The group of complex 
numbers, and the group of vectors in a linear space, will be used several times. 

The main result from the theory of electrical networks we are going to use is that 
given a connected graph with voltages assigned to a nonempty set of nodes, there 
is always a unique assignment of potentials to the remaining nodes and currents to 
the edges satisfying Ohm’s and Kirchhoff’s Laws. 

Returning to the proof of T heorem [4.2| consider the graph G as an electrical 
network, where each edge represents a conductor with unit resistance. Given a set 
S CV and a function hy : S > R, let a current flow through the network while 
keeping each v € S at potential h9(v). Then the potential h(v) defined for all nodes 
v is an extension of ho, and it is harmonic at every node v € V\S. Indeed, the 
current through an edge uv is h(u) —h(v) by Ohm’s Law, and hence by Kirchhoff’s 
Current Law, > jen (i) (h(j) —h(i)) = 0 for every node i. 

As a special case, if we send unit current from node s to node t, then the 
potentials h of the nodes satisfy the equation Dh = 1;,—1,. 


4.1.4. Rubber bands. We have seen that harmonic functions arise as 1- 
dimensional special cases of rubber band extensions. Let us discuss a further prop- 
erty for this model. 
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MS 


FIGURE 4.2. Stretching two nodes of a graph with edges replaced 
by rubber bands. The figure should be one-dimensional, but it is 
vertically distorted so that the graph can be seen. 


Consider the case of stretching just two nodes s and t to a unit distance apart 
(Figure |4.2). We see that they pull our hands with a force of 


(4.8) F(s,t)= $0 (h(u)—A(s)) = $0 (A(é)—A(u)) 


ue N(s) uEN(t) 


(by basic physics, both of our hands feel the same force, in opposite directions). 
Considering the energy balance gives further important information. Imagine that 
we slowly stretch the graph until nodes s and t will be at distance 1. When they 
are at distance x, the force pulling our hands is «F(s,t) (everything scales by a 
factor of 2), and hence the energy we have to spend is 


1 


aco dx = 5F (st). 
0 


This energy accumulates in the rubber bands. By a similar argument, the energy 
stored in the rubber band wv is (h(v) —h(u))*/2. By conservation of energy, we 
get the equation 


2 
(4.9) S> (h(v)—h(u))” = F(s, t). 

UuveEe 
(It is easy to prove identities (4.8) and (4.9) for harmonic functions without any 
reference to physics.) 


4.1.5. Relating different models. A consequence of the uniqueness prop- 
erty in Theorem [4.2] is that the harmonic functions constructed in sections 


[4.1.3] and are the same. As an application of this idea, we show an interesting 
identity [Chandra et_al. 1989]. 

Considering the graph G as an electrical network, where every edge has resis- 
tance 1, we define the effective resistance R(s,t) between nodes s and t by sending 
a unit current from s to t, and measuring the voltage between s and t. Considering 
the graph G as a rubber band structure in equilibrium, with two nodes s and t 
nailed down at 1 and 0, let F(s,t) denote the force pulling the nails (cf. (4-10)). 
Doing a random walk on the graph, let comm(s,t) denote the commute time be- 
tween nodes s and t (i.e., the expected time it takes to start at s, walk until you 
first hit t, and then walk until you first hit s again). 


Theorem 4.6. Let G be a connected graph and s,t €E V. The effective resistance 
R(s,t), the force F(s,t), and the commute time comm(s,t) are related by the equa- 
tions 


1 _ comm(s, t) 


ne F(s,t) 2m 
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Proof. Consider the function h € RY satisfying h(s) = 0, h(t) = 1 and har- 
monic for v # s,t (we know that this exists and it is uniquely determined). By 
(4.8), we have F'(s,t) = do jens) h(J). 

By the discussion in Section |4.1.3} h(v) is equal to the potential of v if we fix 
the potential of s and t as above. By Ohm’s Law, the current through an edge sj 
is h(j)—h(s) = A(j). Hence the current through the network is }) jen.) h(J) = 
F'(s,t), and the effective resistance is R(s,t) = 1/F(s,t). 

The assertion about commute time is a bit more difficult to prove. We know 
that h(w) is the probability that a random walk starting at u visits b before a. 
Hence p = (1/deg(s)) Duen(s) Mu) = F(s,t)/deg(s) is the probability that a 
random walk starting at s hits t before returning to s. 

Let T be the first time when a random walk starting at s returns to s and 
let S be the first time when it returns to s after having visited t. We know from 
the theory of random walks that E(T) = 2m/deg(s) and by definition, E(S) = 
comm(s,t). Clearly T < S and T = S means that the random walk starting at s 
hits t before returning to s. Thus P(T = S) = p. If T < 8S, then after the first 
T steps, we have to walk from s until we reach ¢ and then return to s. Hence 
E(S—T | T < S) = E(S), and so 


E(S—T) =pE(S—T | T=S)+(1—-p) E(S—T | T <S) =(1-p)E(S). 
This gives 


2m a _ _ 7 = = F(s,t) 
Tega ~ ECT) = ES) -E(S—T) = pE(S) = TS 


Simplifying, we get the identity in the theorem. 


comm (s, t). 


As a further application of this equivalence between our three models, we de- 
scribe what can be said if an edge is deleted. 


Theorem 4.7. Let G be a connected graph, and let us connect two nodes i and j 
by a new edge. Then for any two nodes s,t EV, 


(a) The ratio comm(s,t)/m does not increase. 
(b) (Raleigh Monotonicity) The effective resistance R(s,t) does not increase. 


(c) If nodes s and t are nailed down at 0 and 1, the force F(s,t) does not 
decrease. 


Proof. The three statements are equivalent by Theorem [4.6} we prove (c). By 
(4.9), it suffices to prove that the equilibrium energy does not decrease. Consider the 
equilibrium position of G’, and delete the new edge. The contribution of this edge 
to the energy was nonnegative, so the total energy decreases. The current position 
of the nodes may not be in equilibrium; but the equilibrium position minimizes the 
energy, so when they move to the equilibrium of G’, the energy further decreases. 


4.2. Estimating harmonic functions 


4.2.1. Harmonic flows. Suppose (for simplicity) that S = {s,t}, and let 
h: V >R be a function with h(s) = 0, h(t) = 1, and harmonic outside S. Then 
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(4.5) tells us that assigning f(e) = h(v)—h(u) to every edge e = uv, we get a flow 
from s to t, which we call a harmonic flow. The value of this flow is 


(4.10) F(s,t)= S> f(sv)= So f(vt). 


vEN(s) vEN(t) 


It will be convenient to normalize differently and consider the flow f = f /F(s,t); 
this has value 1. 

An s-t flow that is, at the same time, a potential, will be called a harmonic 
flow. The potential constructed from a harmonic function with poles s and t is 
a harmonic flow, and vice versa. The dimension of the linear space of s-t flows 
is m—n+2, and the dimension of the linear space of potentials is n—1. Since 
(m—n+2)+(n—1) > m, these two spaces have a nonzero intersection. This gives 
yet another proof that harmonic functions with poles s and ¢ exist. 

But considering functions on the edges, not only on the nodes, gives more. 


Theorem 4.8. Let G be a connected graph and s,t,e V. Then 
(4.11) R(s,t) = min S~ g(e)’, 
ecH 


where g: E — R ranges over all functions satisfying occ gle) = 1 for every s-t 
cut C. Furthermore, 


(4.12) F(s,t) = min S~ g(e)’, 


where g: E —R ranges over all functions satisfying ee K(P) g(e) > 1 for every 
s-t path P. In both optimization problems, the optimizing function g is a harmonic 
s-t flow. 


We can restate these formulas as follows. For g: E — R4, let cg(s,t) be the 
minimum capacity of an s-t cut (viewing g as a capacity function on the edges), and 
let dg(s,t) be the minimum length of an s-t path (viewing g as a length function 
on the edges). Then 


ee ge Igl? 

‘ t)= ——_ as F(s,t) = == 
on He eae oo aoe 
We can also write this as 

d,(s, t)? t 
(4.14) R(s,t)= max are and F(s,t)= max cals, t) 
gESR,  |g| g: ER, |g 


Proof. We prove the first identity; the second one can be proved similarly, or 
by referring to a general duality result (Proposition[C.2]in the Appendix). Let g be 
an optimizer in (4.11). We may assume that g > 0 (else, we can replace it by |g|). 
We replace each undirected edge e by two, oppositely directed edges e’ and e”, and 
consider g(e) = g(e’) = g(e”) as the capacity of each of them. The condition on 
g says that every s-t cut has total capacity at least 1. By the Max-Flow-Min-Cut 
Theorem, the graph carries an s-t flow f of value 1 with 0 < f < g. We may 
assume that min{ f(e’), f(e”)} = 0. Replacing g with g’(e) = max{ f(e’), f(e”)} we 
get a function satisfying the conditions of the theorem, and decrease the objective 
function, unless g’ = g. If we delete one of e’ and e” for which f = 0, we get an 
orientation of G on which g defines a flow of value 1. 
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So g minimizes 5>, g(e)? over all s-t flows of value 1. Such flows form an affine 
subspace A C R”. The minimizer is the closest point of A to the origin; hence it is 
orthogonal to A. Shifting A to the origin we get the space of all circulations; the 
orthogonal complement of this space is the space Ao of all potentials. So g is both 
an s-t flow of value 1 and a potential, i.e., a harmonic flow. 

We can write g(uwv) = h(v)—h(u) for every edge uv, and h is a harmonic 
function with poles at s and t. By the electrical networks model, we have R(s,t) = 
h(t)—h(s). Switching to the rubber band model, )>, g(e)? = >>, (h(v)— h(u))” is 
twice the energy stored in the rubber bands, when s and t are stretched to a distance 
of R(s,t). By our discussion of the rubber band model, this is F(s,t)R(s,t)? = 
R(s,t). 


4.2.2. Combinatorial bounds. Choosing appropriate functions g in (4.13) 
or (4.14), we can prove useful combinatorial bounds. Let us start with an important 
example. 


Example 4.9. Consider the grid formed by the lattice points (i, 7) with |¢|, |j| < k. 
Identify the boundary to a single node t, and let s = (0,0). We show how to use 
the inequalities in the previous section to prove 


1 
(4.15) R(s,t) ~ Z Ink. 
To this end, let us define 
1 
= EB 
ge)=—5 (ee), 


where r is the distance of e from s in the grid graph. (We had to add 1 so that 
the edges incident with s get finite weight.) This value of g(e) is not the harmonic 
flow from s to t, which would be more complicated to express. To motivate, notice 
that (in the whole grid) there are 4r+4 edges at distance r from s, forming a cut 
C, of total weight is 4, independently of r. So from the point of view of these cuts, 
g “looks like” and s-t-flow of value 4. 


We need to do some computations. First, EF contains k of the cuts Co,...,Ck_-1, 
and is covered by the cuts Co,...,C2~-1, and hence 
k-1 l ky 
2> 4r+4 =4 — =4H 
a2 rt Dre 425 = Be 
r=0 j=l 
and similarly, 
2k 4 
Ig? <4) = < 4H, +4. 
J 
q=1 


It is clear that every s-t path contains an edge at distance r from s for every r, and 
the shortest path contains exactly one of each, so 


d,(s, t) = Ax. 
Finally, we have 
(4.16) Cg(s,t) = 4. 


The proof of this is elementary, but takes a bit more work, and is left to the reader 
as Exercise 


4.2. ESTIMATING HARMONIC FUNCTIONS 49 


Now by (4.13), 
7 4H,+4 H,+1 
R(s,t) < lg| 5 < ko -_ kt 
Cq(s, t) 16 4 


and by (4.14), 


d,(s,t)? . sa Hy;,—-1 
lg? — 4H, +4 4 
These estimates prove (4-15). 4 


R(s,t) > 


Let us return to general graphs, and formulate a corollary of Theorem 


Corollary 4.10. Let C),...,C, be edge-disjoint cuts separating s and t. Then 


k 
(4.17) R(s,t) > 2 G 


Proof. Let 

Gi if e € C; for some 7, 
j 

0, otherwise. 

Then 


Ig? = S— gle => gee Sa 


e€E j=l eceC; j=l 
Since every s-t path intersects every cut C;, we have 


and so 


(SEAWGl) 


k 
1 
ROO 2 SEG) 2 


As a further application, we prove a bound on the “smoothness” of a harmonic 
function. 


Corollary 4.11. Let G be a graph, and leth: V > Ry be harmonic outside a set 
SCV. Letu,v € V\S such that uw € E and h(u) < h(v), and let Ch,...,C, be 
edge-disjoint cuts in G\ uu separating u from S’ ={x ES: h(x) < h(u)}. Then 


h(v) —h(u) < (S a) hw: 


Proof. Recalling the rubber band model, we can think of each node x € S' as 
nailed at h(x). Then |h(w)—h(v)| is the length to which uv is stretched. We may 
assume that h(w) < h(v). Let G’ = (V’, E’) be the union of those paths from u 
to a nailed node along which h is monotone decreasing. We may assume that all 
nodes of V\V’ are nailed; in particular, v is nailed. 

We claim that deleting the edges in E\ E’, the length to which wv is stretched 
is increasing. This follows if we show that all nodes of V’ move down. Suppose 
that this is not the case, and let w € V’ be node that moves up the most, say by a 
distance of s. Then all its neighbors in V’ move up by at most s, so they will pull 
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down at least as much as before. In addition, any deleted edge pulled w upwards. 
It follows that equality must hold, so all neighbors of w must move up by the same 
distance s. Since w is on a path from u to a nailed node z, this implies that 7 must 
move up by s, which is impossible as zx is nailed. 

By a similar argument, we may assume that h(a) = 0 for all nodes in S’. It 
follows by that the force with which G’ pulls u down is 


1 a 
Fa(s,u) = Ra(s,u) < (~ a) h(w) 


i=l 


(note: G’ is stretched to a length of h(u)). This is counteracted by the force on the 
edge uv, which is h(v) —h(w). 


We conclude with a special case showing that the bound in the previous example 
is nontrivial. We return to the planar grid, but more generally than before we 
consider a simple closed lattice polygon P, and the subgraph G of the grid formed 
by all edges and nodes on or inside P. Let ho : V(P) — [0,1] be given, and let 
h be the harmonic extension of ho to the interior of P. The following Corollary 
shows that far from the boundary, h is “smooth”. (Such a result does not remain 
true in dimension 3, see Exercise |4.11}) 


Corollary 4.12. Let uv € E, and let t be the graph distance of uv from P. Then 


(wu) -h(w)| < 

Proof. Let C; be the cut formed by all edges with one endpoint at Manhattan 
distance j and the other at Manhattan distance 7—1 from u. (The Manhattan 
distance is the graph distance in the whole grid graph, which may be smaller than 
the graph distance in G.) Then |C;| < 67, and the cuts C),...,C; separate u from 
P. So Corollary [4.11] implies that 


h(v) —h(u) < (S a) < ( 


1 )~ 2 6 
jal 67 Int 


4.3. Inverting the Laplacian 


In the proof of the existence of harmonic functions with given poles, we had 
to invert the Laplacian—in a sense. It will be useful to discuss the solution of an 
equation 


(4.18) Lx =a. 


First, for this to be solvable, we must have 1'a = 0; indeed, 1'a = 1'Lax = 0. 
Second, we may assume that 1'xr = 0; indeed, if x is a solution, then x+a1 is a 
solution for every real number a, and from this family of solutions, it suffices to 
find the one with 1'z = 0. Now if 1’ = 1'a = 0, or equivalently Jz = Ja = 0, 
then 


(4.19) (L+J) a= (L4+J) "Le = (14+ J) (1+ J)c=2, 


thus giving an explicit solution of (4.18). 
As an important special case of this simple computation, let G be an electrical 
network with unit resistances on the edges, and let us send a unit current from 
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node s to node t. Then the potentials x of the nodes will satisfy the equation 
Lx =e,—e;, and hence x = (L+ J)~'(e,—e;). The effective resistance is 


(4.20) R(s,t) = 2,—a; = (es—e;)'(L+ J) "(es—e&). 


We describe two (related) algebraic methods to produce certain “inverses” of 
the Laplacian. 


4.3.1. Poisson’s formula and Green’s function. We can rewrite formula 
as follows. Let a,,s(u) denote the probability that the first point in S of 
a random walk starting at v is wu (where S C V is a nonempty set). We call 
Qy,g: S + Ry, the hitting distribution (from v on S). Using this distribution, the 
harmonic extension of a function ho on S can be expressed as 


(4.21) h(v) = © av,s(u)ho(u). 
ues 
This equation can be regarded as a discrete version of Poisson’s Formula. 
Set T = V\S. To keep our notation a bit simpler, we write (in this section) 
fr for the restriction f|r of a function f to a set T. We are looking for a function 
h: V —>R such that 


(4.22) (Lh)r = 0, and hg = ho. 
Let us partition the Laplacian of G as 
Lr Lrg 
4.23 L= ; 
ve) Ge Ls 
(where the first block of rows and columns corresponds to nodes in T; note that 
Lgr = —Agr). The matrix Lr will play an important role, and we need two simple 


properties of it: Dy is invertible and its inverse is nonnegative. Both properties 
mentioned can be proved in (slightly) larger generality, and we will need them 
elsewhere in this more general form. 


Lemma 4.13. Let M € RY*” be a positive semidefinite matrix whose nullspace 
is one-dimensional and it is spanned by a vector with nonzero coordinates. Then 
every proper principal minor of M is positive definite. 


Proof. Let Mr denote the principal minor of M formed by the rows and 
columns indexed by elements of T C V. The matrix M7 is trivially positive semi- 
definite. If Mpa = 0 for some 0 4 x € R’, then for the vector y = (5) € RY, we 
have y'My = «'Mra = 0. Since M is positive semidefinite, this shows that y is in 
the nullspace of M. But y cannot be parallel to the vector spanning the nullspace 
of M, a contradiction. 


Lemma 4.14. Let M be a positive definite matrix whose off-diagonal entries are 
nonpositive. Then M-' > 0. 


Proof. We may assume that the largest eigenvalue of M is 1. Then its diagonal 
entries are at most 1, and hence B = I; — M is positive semidefinite and all entries 
of it are nonnegative. Since M is positive definite, all eigenvalues of B are strictly 
less than 1, and hence the series expansion 


M-1=(-B)“ = (1+ B+B? +...) 


is convergent. Since all entries of all terms in the last sum are nonnegative, the 
same holds for M~!. 
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Coming back to solving equation (4.22) for the function h, we see that the 
restriction hg is given, and to get the restriction hr, we only have to solve the 
equation 

Lrhpt+Llrgshs = 0. 
Since Lr is invertible by LemmaJ4.13] this system can be solved for hr, and we get 
that 


(4.24) hp = —L7z'Lrshs 


is the unique harmonic extension of hg. 

The matrix Lin's considered as a function in two variables, is a discrete analogue 
of Green’s function in analysis, and we’ll call it the (discrete) Green function of the 
graph G, on the set T. (To relate this to the Green function in analysis, think of 
T as an open domain and of S, as its boundary. It is easy to see that nodes of S 
that are not adjacent to T play no role in the arguments below.) We denote Lin 
by Hr or by A if T is understood. 

Formula expresses Ar as a linear combination of the values of hg: 


(4.25) hivo)=S> SY) Aywh(u) — (v E T?). 
ueS wETNN(u) 


We have seen such an expression using the hitting probabilities on S. Comparing the 
two, and using the uniqueness of harmonic extensions, we get that the probability 
that a random walk starting at a node v € T first hits the set S at node u is given 
by 
(4.26) dy,s(u) =—-(HLrs)yu= >> How. 

weEN(u)NT 


Generalizing equation (4.25), Green’s function provides an important formula 
to express the values of an arbitrary function in terms of its harmonicity defect on 
T and its values on S: 


Lemma 4.15. For every function f: VR andv eT, 
fe) =O Hw (EN) + YS fw). 
weT ueSON (w) 


We can write the formula more compactly: 


(4.27) fr =H(Lf)r—-HAL rs fs, 

or even less compactly: 

(4.28) fe) = So Mwl(Lf(w)+>> J) Bef (u). 
wet u€S weETNN(u) 


If f is the harmonic extension of fs, then the first term vanishes, and the second 
gives (4-25). 
Proof. From the definition of the partition of L, we have 
(Lf)r =Lrfrt+Lrsfs. 
Applying H from the left, 
A(Lf)r = HLrfr+HLlrsfs = fr+HLrsfs. 


Rearranging, we get (4.27). 
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Let us conclude with a nice special case, raised by Grassberger. Let G be a 
simple graph, and fix a node v € V(G). For two nodes i,7 € N(v), let oj; denote 
the probability that a random walk starting at v and leaving through the edge vi, 
first returns to v through the edge vj. 


Corollary 4.16. For any two distinct nodes i,j € N(v), we have oi; < oi. 
Proof. Let T = V(G)\{v} and M = L7'. We claim that 
(4.29) 0 = M;;. 


Indeed, let us split v into deg(v) nodes 7’ (i € N(v)), to get a graph G’. So 
i’ € E(G’). Let S = {i’ : i © N(v)}. Then o;; can be interpreted as the 
probability that a random walk on G’ starting at 7 hits S at 7’. By (4.26), we have 


O15 = s Miw = Mis, 


weN(j/)AT 
proving (4.29). Since (oj; : j € N(v)) is a probability distribution, we have 
(4.30) S> Mj;=1 (eV), 
JEN(v) 


It also follows that the matrix (0;;);,;ew(v) is symmetric and positive definite. 

Let 7 € N(v), and let us pivot in M on the entry (7,7), and then delete the 
i-th row and column. The resulting matrix is the inverse of the matrix Ly\{y,i}, 
which is a positive definite matrix with nonpositive offdiagonal entries, and hence 
its inverse is nonnegative by Lemma|4.14] This implies that 


Mi Myr = MjiMix 
for all i,k, 7 € N(wv). Note that strict inequality holds for k = j, since M is positive 
definite. Summing over all k € N(v), and using (4.30), we get the Corollary. 


4.3.2. Generalized inverses, random walks and resistance. The Lapla- 
cian LC is a singular matrix, and hence it does not have an inverse. But, as every 
matrix, it has a pseudoinverse. While there are different notions of pseudoinverses, 
in this short section we’ll mean that introduced by E.H. Moore, A. Bjerhammar 
and R. Penrose. The pseudoinverse A~! for an arbitrary matrix A can be defined 
by the conditions 


(4.31) AA A= A, A> Asm = Amt, 
and 
(4.32) AA™', A7'A are symmetric. 


It is not very hard to prove that a matrix A~! with these properties always exists, 
and is uniquely determined (Exercise|4.12). We shall need this construction for the 
Laplacian of a connected graph G only, in which case it is almost trivial: writing 


n 

rT 

L= ) MURURUE, 
k=2 


where fly = 0 < pig < +++ < py are the eigenvalues of the Laplacian, and vj,...,Un 
are the corresponding eigenvectors, the matrix 


Lt= S- 2 eat 
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satisfies the conditions of the pseudoinverse, which is straightforward to check. It 
is also easy to compute that 


1 
4.33 LOL=LL71=I]-vv! =I-—J. 
‘ nr 


To compute the pseudoinverse of L, we don’t have to find the spectral decom- 
position of L. It suffices to use again that D+ J is an invertible matrix with spectral 
decomposition 


n 
T T 
L+J = nvyv;, + y LEVEVES 
k=2 
and its inverse is 


1 ate 1 
L 1 5) F T apy = 7 Lo}. 
(L+J) mata 7243, wait 
Hence 
1 
i -1 
(4.34) L-* =(L4+J) mer 


(We could use in these arguments a matrix L+cJ with any nonzero c instead of 
L+J; it would lead to similar formulas.) 

We can go a little further: since LD is positive semidefinite, we can raise its 
nonzero eigenvalues to any real power, and define 


n 
t t T 
[= 5 HEURUE 
k=2 


for t € R. This way we get a family of symmetric matrices, so that L1 = L, L' isa 
continuous function of t, and L't* = LtL* for s,t € R. These matrices can be used 
to construct an interesting geometric representation, which encodes the effective 
resistances in the graph (cf. also equation above): 


Proposition 4.17. Let G be a connected graph, with a unit resistance on each 
edge. Let u; € RY (i € V) be the columns of L~\/?. Then 


R(i,j) = (Las + (L7") jg — (2 ig = [wy — ul? 
for any two nodes 1 and j. 


Proof. Let us push a unit current from i to j7, by keeping 7 and 7 at appropriate 
voltages. Let x, be the voltage of node k. Only the differences of these voltages 
matter, so we may assume that lx = 0. Then the Kirchhoff node equations say 
that La = e; —e;, and using (4.33), we get that 


r= L~*(e;—€;). 
So 
R(i, j) = vj — a; = (ej; -—&;)"L“" (ej —e,) LG + L5 -2L;;, 
which we can also write as 


R(i, j) = |E-"/?(ey —e1) ? = [uy — uy)? 


These constructions work for edge-weighted Laplacians as well. 
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Remark 4.18. For invertible matrices, their adjoint is a scalar multiple of their 
inverse. The adjoint can be defined for singular matrices; is the generalized inverse 
of the Laplacian related to its adjoint? Not really. Since LZ has rank n—1, the 
adjoint is nonzero. However, from the equation LL*4) = det(L)I = 0 we see 
that every column and (by symmetry) every row of L*4 is a multiple of 1, and so 
L**i = cJ. The only interesting information contained in the adjoint is the constant 
c, which is the number of spanning trees in the graph (Exercise |4.6). 


Exercise 4.1. Let G be a connected graph and T, a spanning tree of G. For 
each edge e € E'\ E(T), let C. denote the cycle of G consisting of e and the path 
in T connecting the endpoints of e. Prove that if Kirchhoff’s Voltage Law holds 
for these cycles Cc, then it holds for all cycles. 


Exercise 4.2. Prove that for every connected graph G, the effective resistances 
R(i,j) as well as their square roots satisfy the triangle inequality: for any three 
nodes a, b and c, R(a,c) < R(a,b)+R(b, c), and \/R(a,c) < WR(a, b) + /R(b, c). 


Exercise 4.3. Let G be a connected graph and let u; € RY (i € V) be the i-th 
column of L~'/?. Prove that for any three nodes i,j,k € V, the triangle U;U; UR 
is not obtuse. When does it have a right angle? 


Exercise 4.4. A function f: V — R is subharmonic at a node v € V if 
(4.35) ra S> h(u) > h(v), 

ue N(v) 
Let S C V be a nonempty set, and let f : V — R be subharmonic at every 
node of V\.S. Let v € V\S, and let a, € S be a random node from the hitting 
distribution on S' (starting at v). Prove that 


f(v) < Eo (f(aw)). 
Exercise 4.5. Show by examples that the commute time comm(a, 6) can both 
increase and decrease when we add an edge to the graph. 
Exercise 4.6. Let G be a connected graph, and let 7(G) denote the number of 
spanning trees of G. Let L’ be the matrix obtained by deleting from L a row 
and the corresponding column. (a) Prove that det(L’) = T(G). (b) Prove that 
det(L+J) = n?T(G). (c) Use these facts to prove Cauchy’s Formula: T(Kn) = 


nr? 


Exercise 4.7. Let G’ denote the graph obtained from the graph G by identifying 
nodes a and b. Prove that R(a,b) = 7(G’)/T(G). 

Exercise 4.8. Let G be a graph and e, f € E, e # f. Prove that T(G)T(G\ 
{e, f}) < T(G\e)T(G\ f). 

Exercise 4.9. To every edge e of the standard grid in the plane, let us assign the 
weight g(e) = 1/(r+1), where r is the distance of e from the origin. Let S be a 
set of edges that blocks every infinite path from the origin. Prove that g(S) > 4. 


Exercise 4.10. Let A be an irreducible positive definite matrix whose off-diagonal 
entries are nonpositive. Prove that all entries of A~' are positive. 

Exercise 4.11. Show by an example that the smoothness property stated in 
Corollary does not remain valid for the 3-dimensional grid. 


Exercise 4.12. Prove that if the rows of a matrix A are linearly independent, 
then AA' is nonsingular, and A~' = A'(AA')~? satisfies the conditions for the 
pseudoinverse of A. 


Exercise 4.13. Prove that every matrix has a unique pseudoinverse. 
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Exercise 4.14. Prove that for a connected d-regular graph G, the pseudoinverse 
of ZL can be used to express hitting times by the following formula: 

Hig = dn((L7")ig — (L7" Jas). 
Extend this formula to non-regular graphs, using the pseudoinverse of the scaled 
Laplacian matrix L=D~/?LD~-1/? instead of L~?. 
Exercise 4.15. We have seen that for every graph G one can define a family of 
symmetric matrices L‘ (t € R) so that L' = L, L’ is a continuous function of t, 
and L't* = L'L® for s,t € R. Prove that such a family of symmetric matrices is 
unique. 
Exercise 4.16. Let M be a positive definite n xn matrix whose off-diagonal 


entries are nonpositive. Let N be a submatrix of M~' formed by rows 1,...,k,i 
and columns 1,...,k,7, where k < i,j <n. Prove that det(N) > 0. 


CHAPTER 5 


Coin Representation 


One of the deepest representation results for planar graphs is the “coin represen- 
tation” (representation by touching circles) [Koebe 1936]. This result has numerous 
versions, extensions, and applications. It gives a Steinitz representation by a very 
special polytope (the Cage Theorem), and it has close relations with the Riemann 
Mapping Theorem in complex analysis. 


5.1. Koebe’s theorem 


We say that a family of (circular) disks is nonoverlapping, if their interiors are 
disjoint. Such a family gives rise to a tangency graph: we represent each member 
of the family by a node, and connect two nodes if the corresponding disks are 
touching. It is natural to turn this into a geometric representation: we represent 
each disk by its center, and each edge by the segment connecting the corresponding 
centers. Trivially, this edge passes trough the point of tangency of the two circles, 
and different edges do not intersect except if they have a common endpoint. So we 
get a planar map (Figure [5.1p. Koebe’s Theorem will provide a converse to this 
easy observation. 


FIGURE 5.1. Tangency graph of a set of disks 


The same graph can, of course, be represented by many systems of disks. For 
example, a cycle can be represented by any necklace of pearls—these may be equal, 
or different, and laid out in many different ways (Figure [5.2). 

If the graph is a triangulation, then the representation is “essentially unique” , 
as we will see. But even in this case, we can modify the set of disks: for example, we 
can apply inversion to all of them with respect to a further circle C' (see Appendix 
[C.4.]] for notions and results concerning circles, relevant throughout this chapter). 
If the center of C' is not covered by any of the disks, then this inversion results in 
another representation by a family of disks. If the center of C is in the interior of 
one of the disks (call this D), then we get a similar picture, except that the other 
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iP eS BS 


FIGURE 5.2. Different representations of a 10-cycle by touching disks 


disks will be contained in the interior of the image D’ of D (Figure |5.3). We say 
that we have a coin representation with D turned inside out. We are going to make 
good use of this version of the coin representation in Section [5.3] 


FIGURE 5.3. Representation of a planar graph by touching disks, 
and a representation of the same graph, where the exterior of one 
of the disks plays the role of its “body”. 


After these remarks, we are ready to state the fundamental theorem of Koebe. 


Theorem 5.1 (Koebe’s Theorem). Every planar graph can be represented as 
the tangency graph of a family of nonoverlapping circular disks. 


Such a family of circular disks is called a Koebe representation, or coin rep- 
resentation of the graph. We’ll prove this theorem after formulating a stronger 
version. 

Koebe’s Theorem can be strengthened and generalized in various ways. To 
motivate the extension we are going to discuss next, consider a bounded triangular 
face p = abc of the tangency map G of a family F of non-overlapping circles, 
where a is the center of the circle C, etc. Let D, denote the inscribed circle of 
p, touching the edges of the triangle at points u, v and w (Figure (5.4). Then 
|a—u| = |a—v| (as these segments are tangents to D, from the same point), and 
similarly |b—u| = |b—w| and |c—v| = |c—w]. So the circle C!, about a with radius 
|a—u| = |a—v|, and the circles C; and C/, defined analogously are mutually tangent. 
It is easy to see that this implies that CZ, = Ca, C; = Cy and Cl = C.. The circle 
Dy, intersects each of these circles orthogonally. 

Now consider another bounded triangular country gq = abd of G, sharing the 
ab with abc. The previous argument implies that the inscribed circle Dg of q 
must also touch the edge ab at u, and it is also tangent to D, at this point. If 
all countries are triangular, then the family #* of inscribed circles of bounded 
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FicurE 5.4. Left: A triangle in the tangency graph and its in- 
scribed circle. Right: For two triangles sharing an edge, the in- 
scribe circles touch the edge at the same point. 


countries is well defined, and its tangency graph is the dual of G (with capital of 
the unbounded country omitted). We can also include the circle inscribed in the 
unbounded country, but its “center” should be thought of as the “point at infinity”. 
What happens if not all countries are triangles? A non-triangular country may 
not even have an inscribed circle (touching all edges). The main result below will 
imply that every 3-connected planar graph G has a special representation as the 
tangency graph of a family of circles in which all countries have inscribed circles. 
To be precise, we define a double circle representation of a planar map G as 
two families of circles in the plane, (C; : i € V) and (D,: p € V%*), so that 
for every edge 77, bordering countries p and q, the following holds: the circles 
C; and C; are tangent at a point x,;; the circles D, and Dj, are tangent at the 
same point x,;; and the circles D, and D, intersect the circles C; and C; at this 
point orthogonally. Furthermore, the interiors of the circular discs C; bounded 
by the circles C; are disjoint and so are the disks D;, except that the circle Dy. 
representing the unbounded country contains all the other circles D, in its interior. 
The conditions in the definition of a double circle representation imply other 
useful properties. The proof of these facts is left to the reader as an exercise 


(cf. Figure [5.5). 


Proposition 5.2. Let (Ci): 1¢V; Dp: pe V*) be a double circle representation 
of a 3-connected planar graph G. Then (a) Every country p is bounded by a convex 
polygon, and the circle D, touches every edge of this polygon. (b) For every bounded 
country p, the circles D, and C; (i € V(p)) cover p. (c) If i € V is not incident 
with p € V*, then C; and De are disjoint. 


The main theorem in this chapter asserts that such representations do exist. 


Theorem 5.3. Every 3-connected planar graph has a double circle representation 
in the plane. 


The double circle representation is not unique (we can apply circle-preserving 
transformations to it), but to make it unique, we only need to fix the circles repre- 
senting the nodes of a triangular country: 


Corollary 5.4. Let G be a 3-connected planar graph with a triangular country abc. 
Fixing three mutually tangent openly disjoint disks in the plane to represent a, b 
and c, there is a unique extension to a double circle representation of G with abc 
as the unbounded country. 
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FiGuRE 5.5. Left: a double circle representation K4 and its dual 
(another complete graph on 4 nodes). For the circle representing 
the capital of the unbounded country, the exterior domain should 
be considered as the disk it “bounds”. Right: a piece of a double 
circle representation of a planar graph and its dual. 


In the next sections, we will prove these basic results. 


5.1.1. Angles and radii. Let G be a 3-connected planar map. We transform 
G so that a triangular country po in G becomes the unbounded country; if G has 
no triangular country, then we interchange the roles of G and G* (see Proposition 
(2.1). Let a,b,c be the nodes of px. For every node i € V, let F(i) denote the set 
of bounded countries containing i, and for every country p, let V(p) denote the set 
of nodes on the boundary of p. Let U = VUV*\po, and let J denote the set of 
pairs ip with p € V*\poo and i € V(p). 

Let us start with assigning a positive real number r, to every node u € U. 
Think of this as a guess for the radius of the circle representing u (we do not guess 
the radius for the circle representing poo; this will be easy to add at the end). For 
every ip € J, we define 
(5.1) Qip = arctan be and Gp, = arctan =. 

V4 Tp 
We can think of these numbers as guesses for the angles of the right triangle formed 
by the centers of circles representing a node, an adjacent dual node, and the inter- 
section point of these circles. It is clear that 
(5.2) Kip tapi = = 

Now suppose that the circles with radii r,, form a proper double circle repre- 
sentation, in which the triangle abc is regular. Consider a node i € V. For every 
country p incident with i, 2a;, is the angle of the corner of p at 7 (Figure [5.6). 
Since these corners fill out the available angle around 7, we have 


Tr, ifi e V\ {a,b,c}, 
ee) ye =e ee if i € {a,b,c}. 
pe F(i) ’ Ys 


We can derive similar conditions for the countries: 
(5.4) S- Ap, = 7 (p € V*\ poo). 
teV(p) 


The key to the construction of a double circle representation is that these conditions 
are sufficient. 
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FIGuRE 5.6. A right triangle in the double circle representation 
and its parameters 


Lemma 5.5. Suppose that the radii r,, > 0 (u € U) are chosen so that the angles 
defined by (5.1) satisfy (5.3)and (5.4). Then there is a double circle representation 


with these radii. 


Proof. Construct two right triangles with legs r; and rp for every ip € J, one 
with each orientation, and glue these two triangles together along their hypotenuse 
to get a kite Kj. Starting from node a of poo, put down all kites Kap (p € F(a)) 
in the order of the corresponding countries in a planar embedding of G. By (5.3), 
these will fill an angle of 7/3 at a. Now proceed to a bounded country p; incident 
with a, and put down all the remaining kites K,; in the order in which the nodes 
of p; follow each other on the boundary of p;. By (5.4), these triangles will cover a 
neighborhood of p;. We proceed similarly to the other bounded countries containing 
71, then to the other nodes of p;, etc. The conditions and guarantee that 
we tile a regular triangle. 

Let C; be the circle with radius r; about the position of node i constructed 
above, and define D, analogously. We still need to define D,,. It is clear that poo 
is drawn as a regular triangle, and hence we necessarily have rg = ry = Tc. We 
define D,,, as the inscribed circle of the regular triangle abc. 

It is clear from the construction that we get a double circle representation of 


G. 


Of course, we cannot expect conditions and to hold for an arbitrary 
choice of the radii r,, (equation is valid for any choice of positive real numbers 
ru). In the next sections we will develop a method to construct radii satisfying 
the conditions in Lemma [5.5} but first we need two simple lemmas, going in this 
direction, but not far enough. 

For a given assignment of radii r,, (uw € U), consider the violation of the condi- 
tions in Lemma [5.5] To be precise, for u € U, we define its defect as follows: 


S- Qup—T, ifueV\ {a,b,c}, 
peF(u) ~ 
(5.5) p= S- Qup— & if u € {a,b,c}, 
peF(u) 
Qpu-T, if ue V*\Dpoo. 
i€V(u) 
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(These defects may be positive or negative.) While of course an arbitrary choice 
of the radii r,, will not guarantee that all these defects are 0, the following lemma 
shows that this is true at least “on the average”: 


Lemma 5.6. For every assignment of radii, we have 


Y= 6. 


ucU 


Proof. From the definition, 


de bu = = ee + S- bz Qip— — 


weU i€V\{a,b,c} \pEF( i€{a,b,c} \pEF (i) 


i do i-™ 


pEV*\Dpoo \iEV(p) 


Every pair ip € J contributes ajp+ap; = 7/2. Since |J| = 2m—3, we get 


(2m 3)5 (n—3)n 37 (f—1)t = (m—n—f4+2)0 


By Euler’s formula, this proves the lemma. 


Next we prove that conditions (5.3) and (5.4), considered as linear equations 
for the angles a;,, can be satisfied. (We do not claim here that the solutions are 
obtained in the form (5.1).) 


Lemma 5.7. Let G be a planar map with a triangular unbounded country poo = abc. 
Then there are real numbers 0 < Bip < 1/2 (ij € J) satisfying 


ee ifie V\{a, b,c}, 
a Bip = 6 if i € {a,b,c}. 


and 
7 * 
> (F-fie) =" (PE V*\P a0). 
i€V(p) 
Proof. Consider the Tutte rubber band embedding of the graph, with a, b,c 
nailed to the vertices of a regular triangle. For i € V(p), let 6,; denote the angle 
of the polygon p at the vertex 7. Then the conclusions are easily checked. 


5.1.2. An almost-proof. To prove Theorem [5.3} we want to prove the exis- 
tence of an assignment of radii so that all defects are zero. Let us start with an 
arbitrary assignment; we measure its “badness” by its total defect 


D= > bul. 
ucU 

How to modify the radii so that we reduce the total defect? The key observation 
is the following. Let i be a node with positive defect 6; > 0. Suppose that we 
increase the radius r;, while keep the other radii fixed. Then a;, decreases for 
every country p € F(i), and correspondingly a,; increases, but nothing else changes. 
Hence 46; decreases, 6, increases for every p € F'(i), and all the other defects remain 
unchanged. 
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Since the sum of (signed) defects remains the same by Lemma 5.6] we can say 
that some of the defect of node ¢ is distributed to its neighbors (in the graph (U, J)). 
(It is more difficult to describe in what proportion this defect is distributed, and 
we will try to avoid to have to explicitly describe this). 

It is clear that the total defect does not increase, and if at least one of the 
countries incident with 7 had negative defect, then in fact the total defect strictly 
decreases. 

The same argument applies to nodes in V*. Thus if the graph (U, J) contains 
a node with positive defect, which is adjacent to a node with negative defect, then 
we can decrease the total defect. 

Unless we already have a double circle packing, there certainly are nodes in U 
with positive defect and other nodes with negative defect. What if these positive- 
defect nodes are isolated from the negative-defect nodes by a sea of zero-defect 
nodes? This is only a minor problem: we can repeatedly distribute some of the 
defect of positive-defect nodes, so that more and more zero-defect nodes acquire 
positive defect, until eventually we reach a negative-defect node. 

Thus if we start with an assignment of radii that minimizes the total defect, 
then all defects must be 0, and (as we have seen) we can construct a double circle 
representation from this. 

Are we done? What remains to be proved is that there is a choice of positive 
radii that minimizes the total defect. This kind of argument about the existence of 
minimizers is usually straightforward based on some kind of compactness argument, 
but in this case it is more awkward to prove this existence directly. Below we 
give a different proof, whose idea is less transparent, but where the existence of a 
minimizer will be easy to prove. 


5.1.3. A more sophisticated objective function. The proof of Theorem 
to be described here is from {Colin de Verdiére 1991]. It might help to follow 
the argument below if we point out that we work with the variables x; = Inr;. 

Proof of Theorem [5.3} For « € R, define 

g(a) = / arctan(e’) dt. 


—co 


It is easy to verify that the function ¢ is monotone increasing, strictly convex, and 
T T 
(5.6) gltl+ < o(2) < 1+ 5lel+ 


for alla ER. 
Let z € RY. Using the numbers £;, from Lemma consider the function 


P(x) = > (b( ap — 24) — Bip(tp —4)). 


ipEeT 


It follows from the convexity of ¢ that ® is convex; it is not strictly convex, because 
it is constant if all differences x, —ax,; remain constant. But it is easy to see that 
this is the only direction in which it is not strictly convex: if we fix x, = 0, then it 
will be a strictly convex function of the remaining variables. 
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Furthermore, ®(x) + co if aq = 0 but |x| > oo. This follows easily from (5.6): 
T 
O(xr) = S- (O(2p — 21) — Bip (tp — 24) > S- (5 |Zp — Zil4 — Bip(Lp w)) 


ipeT ipEed 


a (Brilep— ails + Bip|atp— |) > min{ Bip, By |p — 24 
ipEeT ipEeT 
(where 8p; = 7/2— Bip). Since Bip, Bp; > 0, each term is nonnegative, and any given 
term tends to infinity if the corresponding difference |x; —x,| tends to infinity. If xq 
remains 0 but: |a| > 00, then (due to the connectivity of G®), at least one difference 
|x; --p| must tend to infinity, and so (x) — oo. 
It follows from these properties of ® that it attains its minimum at some point 
y € RY, and here 
O® 
5.7 ——(y) =0 EU). 
(5.7) 52)=0 (wed) 
We define the radii r; = e”* (i € U), and the values a;, according to (5.1). To com- 
plete the proof, it suffices to check that these radii and angles satisfy the conditions 
of Lemma 
Let 7 € V, then 


mt dH Wp-¥ a Bip =— Y 7 aretan(e””)+ S7 Bip, 


pEF (i) pEF(i pe F (i) pEF (i) 


and so by (5 
(5.8) y Qip = > arctan * = > arctan(e4? 4) = =, Big: 


peF(i) pe F(t) peF (i) peF(i) 


Using the definition of the numbers (, this proves . Condition (5.4) follows by 
a similar computation. 


Proof of Corollary The existence of the representation follows by The- 
orem In the proof above, the fact that ® is strictly convex (if x, = 0 is fixed) 
implies that the minimizer of ® is unique, and grad ®() = 0 happens at this unique 
point only. The computation in the proof above shows that every double circle rep- 
resentation (with ra = ry = re fixed) gives a point x where grad ®(2) = 0, which 
shows that the double circle representation is essentially unique. It is not hard 
to extend this argument to the case when we fix three arbitrary mutually tangent 
disks to represent the nodes of the unbounded country. 


Remark 5.8. The proof described above provides an algorithm to compute a 
double circle representation (up to an arbitrary precision). It is in fact quite easy 
to program: We can use an “off the shelf” optimization algorithm for smooth convex 
functions to compute the minimizer of F' and through it, the representation. 


5.2. Formulation in space 


5.2.1. Double cap representation. We can lift the Koebe or double circle 
representation of a graph from the plane to the sphere. The advantage will be that 
we do not have to deal with the exceptional unbounded country, and there will be 
a perfect duality between a graph and its dual. The price we pay is that we have 
to leave the more familiar realm of plane geometry for spherical geometry. 
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FiGuRE 5.7. A planar map, its representation by touching circles, 
and a stereographic projection of the latter to the sphere. 


On the sphere, we represent the nodes of the graph and those of its dual by 
“caps”. It will be convenient to specialize some notions from Appendix to 
the ordinary 2-dimensional sphere. A circle on a sphere cuts the sphere into two 
regions; together with one of the regions, the circle forms a cap. The cap may cover 
a hemisphere or even more; if it covers less than a hemisphere, we call it proper. A 
great circle (like the equator) bounds two hemispheres, which are not proper caps 
by this definition. Every other circle bounds one proper and one improper cap. 

Note that in connection with a cap, we can talk about more than one notions 
of “center”. To be more precise, let S be the unit sphere, and let C’ be a cap on 
S with boundary circle C’. The circle C’ is the intersection of the sphere with a 
plane, and in this plane it has a center a. The line through the origin orthogonal to 
this plane intersects the cap at a point c, which we call the center of the cap C. If 
C is proper, then there is another useful point p € @ to consider, which is the point 
from which the portion of the sphere that is visible is exactly the cap C’. The circle 
C’ is the “horizon” from p. Clearly a, c and p are on one line through the origin, 
and by elementary geometry, |a|-|p| = |c|? = 1. We call p the pole of the cap C. 

We define a double cap representation of a planar map G as two families of caps 
ona sphere S, (C;: i€ V) and (D,: p € V“*), so that for every edge ij, bordering 
countries p and q, the following holds: the interiors of the caps C; are disjoint, and 
so are the interiors of the caps D,; the caps C; and C; are tangent at a point uj,;; 
the caps D, and D, are tangent at the same point u,;; and the boundary circles of 
D, and Dg intersect the boundary circles of C; and C; orthogonally. 

Such a double cap representation provides an embedding of the graph in the 
sphere, by representing each node 7 by the center of the cap C;, and connecting 
adjacent nodes i and 7 by an arc of a large circle through the tangency point uj,;. 
Similarly we get an embedding of the dual graph. Note that in this spherical setting, 
the primal and dual graphs are completely interchangeable. 

Similarly as in the plane, these conditions imply other useful properties of the 
sphere and the embedding: Every country p is bounded by a convex spherical 
polygon, and the cap D, touches every edge of p. Every country p is covered the 
caps D, and C; (i € V(p)). If i € V is not incident with p € V*, then C; and D, 
are disjoint. 

We have not excluded caps covering more than a hemisphere in the definition 
of a double cap representation; if all caps in a double cap representation are proper, 
we call the representation proper. 
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Theorem 5.9. Every 8-connected planar map G has a proper double cap represen- 
tation on the sphere. 


Proof. Let G be a 3-connected planar map, and let p, be its unbounded 
country. Let (C;: i¢€ V) and D,: p € V*) be a double circle representation of G 
in the plane. Take any sphere S touching the plane, and project the plane onto the 
sphere by inverse stereographic projection. Let (C{: i¢ V) and Di: p¢ V*) be 
the images of the circles in the double circle representation; these are circles by the 
fundamental properties of stereographic projection. We define caps (Ci: i € V) 
and D, : p€ V*) with boundaries C; and D,,, respectively: we assign the cap not 
containing the North pole to every circle exept to Dp,,, to which we assign the 
cap containing the north pole. This way we get two families of caps on the sphere. 

Clearly the cap Cc ; is the image of the disk bounded by C;, and similar assertion 
holds for D;, except that t Doss is the image of the exterior of Dp. (plus the north 
pole). Hence the caps C; are nonoverlapping, and so are the caps D;. All the 
properties of the double cap representation follow easily from the adirespondine 
properties of the double circle representation. 

If we want a proper representation, all we have to do is to choose the sphere S$ 
appropriately: we let it touch the plane at the center c of D,,,, in which case Dp: 
will be a cap centered at the north pole. The size of this cap depends on the radius 
of S. If the radius of S is half of the radius of D,,, then Dy. is just the northern 
hemisphere. We choose the radius of S a bit smaller than this, so that De. be 
contained in the interior of the northern hemisphere, but just barely. All other caps 
Dis as well as the caps C; for i ¢ V(poo) are properly contained in the complement 
of the interior of Do and so if Dy. is sufficiently close to the whole northern 
hemisphere, then these caps will be proper. A cap C; for i € V(poo) contains 
neither the North pole nor the South pole (since the South pole is the center of the 
circle D,,., which is orthogonal to C; and hence its center is an exterior point of 
C;). This implies that C, is smaller than a hemisphere, and so the representation 
is proper. 


5.2.2. The Cage Theorem. One of the nicest consequences of the coin rep- 
resentation (more exactly, the double cap representation in Theorem |5.9) is the 
following theorem [Andre’ev 1970al, which is a far reaching strengthening of the 
Steinitz Representation Theorem. 


Theorem 5.10 (The Cage Theorem). Every simple 3-connected planar graph 
can be represented as the skeleton of a convex 3-polytope such that every edge of it 
touches a given sphere. 


Proof. The nice picture of a double cap representation translates into polyhe- 
dral geometry as follows. Let (Ci: i€ V) and (D,: p € V*) be a proper double 
cap representation of G. Let u; be the pole of C;, and let v, be the pole of Dp. 
Let ij € E, and let pq be the corresponding edge of G*. The points u; and u; 
are contained in the tangent line of the sphere that is orthogonal to the circles Ci 
and Ci at their common point u,,;; this line is clearly the common tangent of the 
circles Di, and Dj at uij. The plane v,u = 1 intersects the sphere in the circle Dj, 
and hence it contains it tangents, in particular the points u; and uj, along with all 
points uz where k is a node of the facet p. Since the cap DB, is disjoint from Ck if 
k is not a node of the facet p, we have V,UK < 1 for every such node. 
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This implies that the polytope P = conv{u; : 7 € V} is contained in the 
polyhedron P’ = {u € R®: v,u <1Vp € V*}. Furthermore, every inequality 
v,u < 1 defines a facet F, of P with vertices u;, where 7 is a node of p. Every ray 
from the origin intersects one of the countries p in the drawing of the graph on the 
sphere, and therefore it intersects F,. This implies that P has no other facets, and 
thus P = P’. It also follows that every edge of P connects two vertices u; and uj, 
where 77 € E, and hence the skeleton of P is isomorphic to G and every edge is 


tangent to the sphere. 


FIGURE 5.8. A convex polytope in which every edge is tangent to 
a sphere creates two families of circles on the sphere. 


Let us note that, conversely, every polytope P whose edges are tangent to the 
unit sphere creates a double cap representation of its skeleton. It is convenient to 
assume that the center of the sphere is in the interior of P; this can be achieved 
by an appropriate projective transformation of the space (we come back to this in 
Section : 

Each node in V is represented by the cap of points you see on the sphere when 
looking from the corresponding vertex, and each facet is represented by the cap its 
plane cuts off the sphere (Figure [5.8). Elementary geometry tells us that the caps 
representing adjacent nodes touch each other at the point where the edge touches 
the sphere, and the two caps representing the adjacent countries also touch each 
other at this point, and they are orthogonal to the first two circles. Furthermore, 
the interiors (smaller sides) of the horizon-circles are disjoint, and the same holds 
for the facet-circles. 


5.2.3. Mobius transformations. We have seen that the double circle repre- 
sentation of a planar graph is uniquely determined, once a triangular country of G 
or G* is chosen as the unbounded country, and the circles representing the nodes 
of this triangular country are fixed. Similar “essential uniqueness” holds true for 
double cap representations on the unit sphere. 

To formalize what we mean by that, observe that we can apply an arbitrary 
circle-preserving transformation of the sphere to any double cap representation, to 
get another one. But these modifications are the only ones to worry about; this is 
asserted by the following spherical version of Corollary [5.4] 


Proposition 5.11. The double cap representation of a 3-connected planar graph 
is unique up to circle-preserving transformations of the sphere. 
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Proof. Let (C;,D,: i € V,p € V*) and (Cj,D,: 1 € V,p € V*) be two 
double cap representations of the same 3-connected planar graph G. One of G 
and G* has a triangular country; let (say) px = abc be triangular country of G. 
We can apply a circle preserving transformation to the representation (C;, D7) to 
obtain that Cl = Ca, C, = Cy and Cl = C, (see Exercise [5.6). This implies 
that Dp. = 1 Choosing the center of Dp, as the north pole, and applying 
stereographic projection, we get two double circle representations of G in the plane 
with poo as the unbounded country and with the same circles representing a, b and 
cin both. Uniqueness of the extension to a double circle representation (Corollary 
5.4) implies that these are the same, and hence (C;,D, : i € V,p € V*) and 

C;,D,: 1€V,p € V*) are the same. 


A nice application of this last fact is a proof of the following theorem 


Mani 1971): 


Corollary 5.12. Every 3-connected planar graph G has a representation as the 
skeleton of a polytope P such that every automorphism of the graph extends to an 
isometry of the polytope. 


Proof. Consider the double cap representation of G. If a is an automorphism 
of G, then relabeling the nodes according to a gives another representation of G, 
using the same caps, just representing different nodes. This representation can 
be obtained by applying a cycle preserving transformation @ to the sphere, and 
by Proposition this transformation is uniquely determined. It follows that 
transformations of the form @, where a is an automorphism of G, form a finite 
subgroup of the group of cycle preserving transformations. It is well known that 
every such finite transformation group is conjugate to a group of isometries, which 
means that after an appropriate circuit-preserving transformation, automorphisms 
will extend to isometries. 


While the double cap representation of a planar graph is “essentially unique”, 
we can make good use of the richness of Mobius transformations, by “normalizing” 
the representation in an appropriate way. A natural normalization goal is that the 
center of gravity of the centers of the caps should be the origin. The reason why 
such a normalization is nontrivial is that when applying a Mobius transformation, 
the caps are mapped onto caps, but the centers of the caps are not necessarily 
mapped onto the centers of their images. In general, it is difficult to compute the 
appropriate transformation directly; to prove its existence, we have to use topology. 
The topological argument, on the other hand, has the advantage that it applies to 
a number of similar normalization demands without any essential change. 

As a preparation, let us discuss the topological tool we need. Brouwer’s Fixed 
Point Theorem implies that if we have a continuous map f : B — B whose 
restriction to the boundary sphere S' is the identity, then the center of B is in the 
range of f. We need an extension of this fact to the case when f is only defined 
in the interior B’ of B. Suppose that the range of f : B’ — B does not cover 
the origin. Trivially, the extension of f to the boundary of B as the identity map 
cannot be continuous; the next lemma say that it is quite a bit discontinuous. 


Lemma 5.13. Let f: B’ > B be a continuous map, and suppose that the origin 
is not in the range of f. Then there are points x, € B’ (k =1,2,...) andy eS 
such that x, > y and f(x;)° > —-y. 


5.3. CIRCLE PACKING AND THE RIEMANN MAPPING THEOREM 69 


Proof. Since the range of f does not contain the origin, the mapping f;(x) = 
—tf(x)° is well defined for 0 < t < 1 and x € B’, and it is a continuous map 
tB > tB. So by Brouwer’s Fixed Point Theorem, it has a fixed point p,, satisfying 


pi = —tf(pr)”. 
We may select a sequence of numbers t, € (0,1) such that t, — 1 and pi, > 
y € B. Since |pz,| = |tef(pe,)°| = te > 1, we have y € S. Furthermore, 


f(Pi,)° = —(1/tk) Pt, > —y- 


Lemma 5.14. Let {Ci,...,C,} be a family of nonoverlapping caps on the sphere S. 
Then there is a Mobius transformation T of the sphere such that v,+---+vVn = 0, 
where v; is the center of T(C;). 


Proof. Excluding trivial cases, we assume that n > 3. For every interior point 
x of the unit ball, we define a conformal (circle-preserving) transformation Tx of the 
unit sphere. For x = 0, we define 7, = ids. If x £0, then we take a tangent plane 
T at x°, and project the sphere stereographically onto T, shrink the plane from 
center x° by a factor of 1—|x|, and project it back stereographically to the sphere. 
It is easy to check that this construction has the following property: if x, € Bisa 
sequence of points such that x, > y € S, then Tx, (z) > y for every zE S, zy. 

Let v;(x) denote the center of the cap 7;(C;) (warning: this is not the image of 
the center of C; in general!), and let f(x) = (1/n)(vi(x)+--:+vn(x)). We want 
to show that the range of f contains the origin. Suppose not, then by Lemma|p.13} 
there are points x, € B’ and y € S such that x; — y and f(x,)° > —y as k > o0. 

Let us think of y as the north pole and —y as the south pole. From the 
properties of 7, it follows that if C; does not contain the south pole, then v;(x,) > y 
as k — oo; if the south pole is on the boundary of C;, then v;(x,) tends to the 
equator. So with at most two exceptional values of i, we have y'v;(x,) > 1, 
and it there are two exceptional values i and j, then y'v;(x,),y'v;(xx) + 0. So 
y'f (xz) > 0 if k is large enough, which contradicts the fact that f(x;,)° > —y. 


Remark 5.15. The topological proof above is quite robust: instead of the center 
of gravity, it applies to other “centers” satisfying some natural conditions. To be 
more precise, consider a continuous function f : Sx---xS > B (n factors of 
the unit sphere S$) with the property that y'f(y1,...,yn) > 0 if y; = y for at 
least n—2 indices 7. Let {Ci,...,Cn} be a family of nonoverlapping caps on the 
sphere S' (n > 4). Then there is a Mébius transformation 7 of the sphere such that 
f(vi,---,;Vn) = 0, where v; is the center of T(C;). (See Exercise [5.13] for a slightly 
stronger statement.) 


5.3. Circle packing and the Riemann Mapping Theorem 


Koebe’s Circle Packing Theorem and the Riemann Mapping Theorem in com- 
plex analysis are closely related. More exactly, we consider the following general- 
ization of the Riemann Mapping Theorem, also due to Koebe: 


Theorem 5.16 (Circle Domain Theorem). Every connected open domain on 
the sphere whose boundary has a finite number of connected components is confor- 
mally equivalent to a domain obtained from the sphere by removing a finite number 
of disjoint disks and points. 
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The Circle Packing Theorem and the Circle Domain Theorem are mutually 
limiting cases of each other |Koebe 1936), |[Rodin—Sullivan 1987]. The exact proof 
of this fact has substantial technical difficulties, but it is not hard to describe the 
idea. 


1. To see that the Circle Domain Theorem implies the Circle Packing Theorem, 
let G be a connected map on the sphere. We may assume that G and its dual G* are 
3-connected, and their arcs are smooth curves (these assumptions are not essential, 
just convenient). Let « > 0, and let U denote the open ¢-neighborhood of G*. By 
Theorem|5.16] there is a conformal map of U onto a domain D’ which is obtained by 
removing a finite number of disjoint caps and points from the sphere (the removed 
points can be considered as degenerate caps). If ¢ is small enough, then these caps 
are in one-to-one correspondence with the nodes of G. We normalize using Lemma 
[5.14]and assume that the center of gravity of the cap centers is 0 (Figure [5.9]shows 
this correspondence drawn in the plane). 


FIGURE 5.9. The Koebe Circle Domain Theorem, applied to a 
small neighborhood of the edges of a planar map, yields a touching 
disk representation of the dual map. 


Letting « + 0, and selecting a suitable subsequence, we may assume that the 
cap representing any given node v € V converges to a cap C,. One can argue that 
these caps are nondegenerate, caps representing different nodes tend to nonover- 
lapping caps, and caps representing adjacent nodes tend to caps that are touching. 
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FIGURE 5.10. Coin representation of a triangular grid, and of the 
graph obtained by adding a new node (the black node on the top), 
the latter converging to a conformal mapping of a triangle onto a 
disk. 


2. In the other direction, let us warm up by describing how to get a conformal 
map from a regular triangle to a circular disk. (Here we argue in the planar setting.) 


5.4. APPLICATIONS OF COIN REPRESENTATIONS 71 


We subdivide the interior of the triangle into “many” small triangles to get piece 
of a triangular grid (Figure 5.10). This graph G has an obvious representation by 
touching circular disks. The union of these disks approximates the original triangle, 
so nothing is gained here. 

However, if we add a new node v and connect it to all the boundary nodes of G, 
we get a triangulation. Representing this triangulation as a tangency graph, with 
the circle representing v turned inside out, we get the picture on the right: we can 
think of it as the middle picture distorted so that the outer disks touch the circle 
C' representing the new node. We can map the full triangle into the interior of C 
by mapping each little triangle affinely to the corresponding triangle on the right. 
Refining the grid, we get a sequence of maps that (with some care) converge to a 
conformal map of the interior of the triangle to the interior of the circle. 

Turning to the general case, and returning to the spherical formulation, let U = 
S?\ Ki\::-\Kn, where Ky,..., Ky are disjoint closed connected sets which do not 
separate the sphere. We may assume that no K; is a single point (these are trivial 
to handle). Let « > 0. It is not hard to construct a family C(e) of nonoverlapping 
caps such that the radius of each cap is less than ¢ and their tangency graph G is 
a triangulation of the sphere. 

Let H; denote the subgraph of G consisting of all edges intersecting K;. If ¢ is 
small enough, then the subgraphs H; are node-disjoint, and each H; is nonempty. 
It is also easy to see that the subgraphs H; are connected. 

Let us contract in G each subgraph H; to a single node w;. The spherical map 
G’ obtained this way can be represented as the tangency graph of a family of caps 
{Du(e): ue V(G"’)}. We can normalize so that the center of gravity of the centers 
of Dy, (€),...,; Dw, (€) is the origin. 

Now let ¢ > 0. By selecting and appropriate subsequence, we may assume 
that each Dy, = Dw,(e) tends to a cap Dy,(0). Furthermore, we have a map f- 
that assigns to each node u of G- the center of the corresponding cap D,. One 
can prove (but this is nontrivial) that these maps f., in the limit as « > 0, give a 
conformal map of U onto $?\ Dw,(0)\--+\ Du, (0). 


5.4. Applications of coin representations 


5.4.1. Planar separators. Koebe’s Theorem has several important applica- 
tions. We start with a simple proof of a version of the Planar Separator Theorem 


Lipton—Tarjan 1979): 


Theorem 5.17. Every planar graph G on n nodes contains a set S CV such that 
|S| < V2n, and every connected component of G\ S has at most 3n/4 nodes. 


This version and the proof to be presented is due to [Miller et_al. 1997]; see 
for an improved analysis of the method. 

We need the notion of the “statistical center”, which is important in many 
other studies in geometry. Let S C R®% be a set of n points in R¢. A point c € R4 
is a statistical center of S if every closed halfspace containing c contains at least 
n/(d+1) elements of S. (We could say open halfspace instead of closed without 
changing this notion.) For d = 1, this point is just the median of a finite set of real 
numbers. This is unique if the set has an odd number of elements, otherwise the 
allowable values form an interval. 


Lemma 5.18. Every finite set of points in R@ has a statistical center. 
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Proof. Let B be a ball containing all points in S, and let H be the family of 
intersection of B with those closed halfspaces that contain more than dn/(d+1) 
points of S. Then all members of H are closed, bounded and convex. Furthermore, 
the intersection of any d+1 of these still contains an element of S,, so in particular it 
is nonempty. Thus by Helly’s Theorem, the intersection of all sets in H is nonempty. 
We claim that any point c € NH satisfies the conclusion of the lemma. 

If H be any open halfspace containing c, then R¢\ H ¢ H, which means that 
HT contains at least n/(d+1) points of S. If H is a closed halfspace containing c, 
then it can be enlarged slightly to an open halfspace H’ that intersects S in exactly 
the same set. We know already that H’ contains at least n/(d+1) elements of S, 
and hence so does H. 


Proof of Theorem |[5.17} Let (C; : i € V) be a Koebe representation of G on 
the unit sphere, and let u; be the center of C; on the sphere, and p;, the spherical 
radius of C;. 

We want to make the origin a statistical center of the points u;. This takes 
a little care, since a set may have more than one statistical center. It is easy to 
see that statistical centers of the set form a convex set, and we can consider the 
center of gravity of all statistical centers; let us call this (just for this argument) 
the central statistical center. It is also easy to see that the central statistical center 
depends continuously on the position of the points of S', and satisfies the conditions 
formulated in Remark So we can transform the Koebe representation into 
one whose central statistical center is the origin. This also implies that all caps C; 
are proper. 

Take any plane H through 0. Let S denote the set of nodes z for which Ci 
intersects H, and let H separate V\S into two sets S; and S9. Clearly there is no 
edge between S; and $2, and so the subgraphs G and G2 are disjoint and their 
union is G\S. Since 0 is a statistical center of the u;, it follows that |.S1|,|S2| < 
3n/4. 

It remains to make sure that S' is small. To this end, we choose H at random, 
and estimate the expected size of S. 

What is the probability that H intersects C;? Since the caps C; are proper, 
we have p; < 7/2 for every 7. By symmetry, instead of fixing C; and choosing H 
at random, we can fix H and choose the center of C; at random. Think of H as 
the plane of the equator, then C; will intersect H if and only if its center is at a 
latitude at most p; (North or South). The area of this belt around the equator is, 
by elementary geometry, 47 sin p;, and since the total area of the sphere is 47, the 
probability that the center of C; falls into here is sin p;. It follows that the expected 
number of caps C; intersected by H is })jcy sin pi. 

To get an upper bound on this quantity, we use the fact that the surface area 
of the cap C; is 47 sin(p;/2)?, and since these caps are disjoint, 


(5.9) » (sin a AA; 


Using that sin p; < 2sin 4, we get by Cauchy-Schwarz 


1/2 ae 
S/ sin pi <Jn (Semon?) <2/n (= (sin *) < 2y/n. 


tEeV iEeV iEeV 
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So the expected size of S is less than 2,/n, and so there is at least one choice of H 
for which |S| < 2\/n. 


5.4.2. Laplacians of planar graphs. The Planar Separator theorem, dis- 
cussed in the last section, was first proved by direct graph-theoretic arguments, and 


other elegant graph-theoretic proofs are available |Alon—Seymour—Thomas 1984}. 


However, for the following theorem on the eigenvalue gap of the Laplacian (Fiedler 


value) of planar graphs |Spielman—Teng 1996a], there is no proof known avoiding 


Koebe’s theorem. Informally, the theorem says that planar graphs are poor ex- 
panders; see Section [B.3] for more on the significance of the eigenvalue gap. 


Theorem 5.19. For every connected planar graph G on n nodes and maximum 
degree D, the second smallest eigenvalue of Lg is at most 8D/n. 


Proof. By Raleigh’s formula, the second smallest eigenvalue of Lg is given by 


Let C; : 7 © V be a Koebe representation of G on the unit sphere, and let u; be 
the center of C;, and p;, the spherical radius of C;. By Lemma [5.14] may assume 
that >>, uj = 0. 

Let u; = (ui1, Ui2, Ui3), then x; = wiz must be taken into account in this formula 
for k = 1,2,3, which implies that 


2 2 
s (Wik —Ujk)” > Az s UsK 
ijeEE ieV 


holds for every coordinate k, and summing over k, we get 


(5.10) S > Jui—uyl? > A255 Jus? = Aan. 


ij€E ieV 


On the other hand, we have 


2 

ju;—uj|? = 4 (sin ntti) =4 (sin C cos & sin cos ft) 
“2 2 2 
<4 (sin +sin ) < 8 (sin &) +8 (sin er) ‘ 


and so by (5.9) 
2 . pi\? 
Y> ju;—u,)? <8D5 (sin 2) < 8D. 


ijEE ieV 


Comparison with (5.10) proves the theorem. 


Among other applications of Koebe’s Theorem of similar nature, we men- 
tion a bound on the cover time of the random walk on a planar graph 


Jonasson—Schramm 2000}. 


74 5. COIN REPRESENTATION 


5.5. And more... 


5.5.1. General convex domains and bodies. Let H be a family of closed 
convex domains in the plane, such that their interiors are disjoint. If we represent 
each of these domains by a node, and connect two of these nodes if the corresponding 
domains touch, we get a Gz, which we call the tangency graph of the family. It is 
easy to see that if no four of the domains touch at the same point, then this graph 
is planar. 

In what follows, we restrict ourselves to the case when the members of H are 
all homothetical copies of a centrally symmetric convex domain. It is natural in 
this case to represent each domain by its center. 


FIGURE 5.11. Straight line embedding of a planar graph from 
touching centrally symmetric convex figures. 


In [Schramm 1991] the following deep converse of this construction was proved. 


Theorem 5.20. For every smooth strictly conver domain D, every maximal planar 
graph can be represented as the tangency graph of a family of homothetical copies 
of D. 


In fact, Schramm proves the more general theorem that if we assign a smooth 
convex domain D; to each node 7 of a triangulation G of the plane, then there is 
a family D = (Di: i € V) of nonoverlapping convex domains such that D/ is 
positively homothetical to D; and the tangency graph of D is G. 

The smoothness of the domain is a condition that cannot be dropped. For 
example, AK4 cannot be represented by touching squares. A strong result about 
representation by touching squares, also due to Schramm, will be stated and proved 
in Chapter |6| 

The Cage Theorem has a far-reaching generalization to general convex 
bodies [Schramm 1993]. 


Theorem 5.21 (Caging the Egg). For every smooth strictly convex body K in 
R°, every 3-connected planar graph can be represented as the skeleton of a polytope 
in IR° such that every one of its edges touch K. 


5.5.2. Polyhedra in hyperbolic space. If we have a double cap representa- 
tion of a planar graph, we can read off another interesting polyhedral representation, 
not of the graph itself, but of its medial graph, and not in Euclidean space, but in 
hyperbolic space. 
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Let (C;: i € V) and (D,: p € V*) form a proper double cap representation 
of G on the sphere S. Let B be the ball bounded by S. We can represent each cap 
C;, as the intersection of a ball A; with S, where the boundary sphere A’, of A; is 
orthogonal to S. Similarly, we can write D, = B,MS, where By is a ball whose 
boundary sphere By, is orthogonal to S. 

Let P denote the set of points in B that are not contained in any A; and By 
This set is open and nonempty (since it contains the origin). The balls A; and B, 
cover the sphere S, and even their interiors do so with the exception of the points 
x;; where two circles C; and C;; touch. It follows that P is a domain whose closure 
P contains a finite number of points of the sphere 9. It also follows that no three 
of the spheres A; and B, have a point in common except on the sphere S. Hence 
those points in the interior of S that belong to two of these spheres form circular 
arcs that go from boundary to boundary, and it is easy to see that these arcs are 
orthogonal to S. For every incident pair (i,p) (¢ € V,p € V*) there is such an 
“edge” of P. The “vertices” of P are the points x;;, which are all on the sphere S, 
and together with the “edges” as described above they form a graph isomorphic to 
the medial graph G™ of G. 

All this translates very nicely, if we view the interior of S as a Poincaré model 
of the 3-dimensional hyperbolic space. In this model, spheres orthogonal to S$ are 
“planes”, and hence P is a polyhedron. All the “vertices” of P are at infinity, and 
P is a Steinitz representation of Gt in hyperbolic space. Every dihedral angle of 
P is 1/2. 

Conversely, a representation of G; in hyperbolic space as a polyhedron with 
dihedral angles 7/2 and with all vertices at infinity gives rise to a double cap 
representation of G. 

What about representing not the medial graph, but an arbitrary planar graph 
by such a polyhedron in hyperbolic space? This is not always possible any more. 
gave a general necessary and sufficient condition 
for the existence of a representation of a planar graph by a polyhedron in hyperbolic 
3-space with “vertices” at infinity and with prescribed dihedral angles. From this 
condition, he was able to derive Theorems[5.10] and [5.3] 


5.5.3. Other angles. The double circle representation of G gives a represen- 
tation of the lozenge graph G® by circles such that adjacent nodes correspond to 
orthogonal circles. We may want to extend Koebe’s Theorem to construct represen- 
tations of more general graphs by orthogonal circles: circles representing adjacent 
nodes must intersect at 90°, other pairs should be disjoint or intersect at an angle 
larger than 90° (i.e., their centers must be farther apart) (Figure [5.12h. 

More generally, we may want to prescribe the angle at which two circles repre- 
senting adjacent nodes meet. Circles touching from the outside correspond to 180°, 
orthogonal circles, to 90°. Most results in this generality concern planar triangu- 
lations. We quote the main result, due to and 
‘Thurston 1997], without proof. See also for a proof 


along the lines of the proof of Koebe’s Theorem given above. 


Theorem 5.22. Let G be a simple planar triangulation different from K4. Let 
an angle 0 < Wi; < 1/2 be assigned to every edge ij. Suppose that the following 
conditions hold: 


(i) If ijk is a triangle in G such that Wi; +Ujn+Vei = 7, then ijk is a country; 
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FIGURE 5.12. Representing a planar graph by orthogonal circles. 


(i) If igkl is a quadrilateral in G such that Wij + jr +Vutvu = 27, then ijkl 
is the boundary of the union of two countries. 


Then G has a representation by circles (C;: i € V) on the 2-sphere such that 
for every edge ij, the circles C; and C; intersect at an angle of Wi;. 


In the special case when all prescribed angles are 7/2, a necessary and sufficient 
condition was proved in |Kotlov-Lovasz—Vempala 1997). 


Theorem 5.23. Let G be a simple planar triangulation different from K4. Then 
G has a representation by orthogonal circles on the sphere if and only if no 3- or 
4-cycle separates it into two parts with at least two nodes. 


With a little care, one can do the representation so that each node is represented 
by a proper cap. 


Exercise 5.1. Let G be a connected planar map with a triangular unbounded 
country abc. With the notation of Section [5.1.1] define J[S] = {ip € J: i,pe S} 
(S CU), and f(S) = 4|J[S]| —|$|+2|Sn {a,b c}|. Prove that (a) f(0) = f(U) = 
0; (b) f(S) <0 for every set 9 CS CU. 


Exercise 5.2 (Continued). Prove that if (ru : wu © U) and (ri, : u €U) are 
two assignments of positive radii such that they give rise to the same defect at 
every node, then there is a constant C > 0 such that rj = Cr; for all i. 


Exercise 5.3 (Continued). (a) Prove that \>.56u > f(S)m for every set 
6 cS CU. (b) Prove that if (dy : u € U) is a vector of real numbers such 
that }7,du =0 and S) eg du > f(S)m for every set 0 C S CU, then there is an 
assignment of positive radii whose defects are 6. = dy for all u € U. (c) Give a 
new proof of Theorem [5.3] based on these assertions. 


Exercise 5.4. Prove that every double circle representation of a planar graph is 
a rubber band representation with appropriate rubber band strengths. 

Exercise 5.5. Prove that in every double cap representation of a 3-connected 
planar graph on the sphere, at most one cap is improper. 

Exercise 5.6. Prove that for three mutually tangent caps C), C2 and C3 on the 
sphere, and three other mutually tangent caps C{, C} and C4, there is a Mobius 
transformation that maps C; onto C{ (i = 1, 2,3). 

Exercise 5.7. Show by an example that the bound in Lemma is sharp. 


Exercise 5.8. Let H be a family of convex domains in the plane with smooth 
boundaries and disjoint interiors. Then the tangency graph of H is planar. 


5.5. AND MORE... 


Exercise 5.9. Construct a planar graph G whose nodes cannot be represented 
by squares in the plane so that the interiors of the squares are disjoint and two 
squares share a boundary point if and only if the corresponding nodes of G are 
adjacent. 


Exercise 5.10. Prove that every simple planar graph can be represented as the 


tangency graph of a family of triangles |de Fraysseix-de Mendez—Rosenstiel 1994). 


Exercise 5.11. Let H be a family of homothetical centrally symmetric convex 
domains in the plane with smooth boundaries and disjoint interiors. Prove that 
the centers of the domains give a straight line embedding in the plane of its 


tangency graph (Figure|5.11). 


Exercise 5.12. Prove that in a representation of a 3-connected planar graph 
by orthogonal circles on the sphere the “interiors” can be chosen so that the 
corresponding caps cover the whole sphere. 


Exercise 5.13. Let f: Sx---xS — B (n factors of the unit sphere S) be a 
continuous function with the property that y'f(yi, ...;¥n) > Oif either y; = y for 
all i, or yi; = y with the exception of one index 7 = 71, for which y;, = —y, or yi = 
y with the exception of two indices 7 = 71,72, for which y Vi; = y Vis = 0. Let 
{Ci,...,Cn} be a family of nonoverlapping caps on the sphere S' (n > 4). Prove 
that there is a Mobius transformation 7 of the sphere such that f(v1,...,Vvn) = 0, 
where v; is the center of 7(Ci). 


Exercise 5.14. Let G be a simple planar map, and consider the graph H = 
GUG®, obtained from G° by adding the edges of the original graph. Let us 
prescribe an angle of 90° to the edges of G°, and 0° to the edges of the original 
graph. Show that a representation of H by circles with these prescribed angles 
yields a double circle representation of G. 

Exercise 5.15. Let G be a connected planar graph on n nodes with maximum 
degree D. Prove that the second largest eigenvalue of the transition matrix of the 
random walk on G is at least 1—8D/n. 


Exercise 5.16. Show by an example that Theorem does not remain valid if 
we drop the conditions of smoothness and strict convexity. 
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CHAPTER 6 
Square Tilings 


We can also represent planar graphs by squares, rather then circles, in the 
plane. There are in fact two quite different ways of doing this: the squares can 
correspond to the edges, a classic result [Brooks et_al. 1940], or the squares can 
correspond to the nodes, a more recent result [Schramm 1993}. 


6.1. Electric current through a rectangle 


The classical paper used a physical model of electrical cur- 
rents to show how to relate square tilings to planar graphs. The ultimate goal was 
to construct tilings of a square with squares whose edge-lengths are all different. 
This will not be our concern (but see Exercise (6.2); we will allow squares of the 
same size and also the domain to be tiled can be any rectangle, not necessarily a 
square. 

Consider a tiling 7 of a rectangle R with a finite number of squares, whose 
sides are parallel to the coordinate axes. We can associate a planar map with this 
tiling as follows. Let us call a maximal horizontal segment composed of edges of 
the squares a long edge. Represent every long edge by a single node. Each square 
“connects” two horizontal segments, and we can represent it by an edge connecting 
the two corresponding nodes, directed top-down. We get a directed graph G7 
(Figure (6.1), with a single source s (representing the upper edge of the rectangle) 
and a single sink t (representing the lower edge). 

It is easy to see that graph G7 is planar: it can be obtained by first consid- 
ering the midpoints of the horizontal edges of the squares, connecting two of them 
horizontally if they are neighbors along a long edge, and vertically if they belong 
to opposite edges of the same square. This graph is clearly planar, and contracting 
the horizontal edges, we get Gr. 

A little attention must be paid to points where four squares meet; we call these 
points 4-fold corners. Suppose that squares A, B,C,D share a corner p, where 
A is the upper left, and B,C,D follow counterclockwise. In this case, we may 
consider the lower edges of A and B to belong to a single long edge, or to belong 
to different long edges. In the latter case, we may or may not imagine that there 
is an infinitesimally small square sitting at p, which may “connect” A with C or 
B with D (Figure (6.2). What this means is that we have to declare if the four 
edges of Gz corresponding to A, B, C and D are adjacent to the same node, 
two nonadjacent nodes, or two adjacent nodes. We can orient this horizontal edge 
arbitrarily. Deciding between these four possibilities will be called “resolving” the 
4-fold corner p. 

If we assign the edge length of each square to the corresponding edge, we get a 
flow f from s to t: If a node v represents a segment J, then the total flow into v is 
the sum of edge lengths of squares attached to J from the top, while the total flow 
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FIGURE 6.1. Constructing a graph with a harmonic function from 
a tiling of a rectangle by squares. 


ce Be 


ty At ae aK 


1 


FIGURE 6.2. Possible resolutions of four squares meeting at a point. 


out of v is the sum of edge length of squares attached to J from the bottom. Both 
of these sums are equal to the length of J. 

Let h(v) denote the distance of node v from the upper edge of R. Since the 
edge-length of a square is also the difference between the y-coordinates of its upper 
and lower edges, the function h is harmonic: 

1 
h(i) = =~ >», At) 
deg (2) sens 
for every node different from s and t (Figure [6.1). 
It is not hard to see that this construction can be reversed. 


Theorem 6.1. For every connected planar map G with two specified nodes s and 
t on the unbounded country, there is a unique tiling T of a rectangle such that 
(resolving the 4-fold corners appropriately) G= Gr. 


Proof. Consider the harmonic function f : V — R with f(s) = 0 and f(t) = 
(obtained, say, as the 1-dimensional rubber band embedding with s and ¢ nailed; in 
Figure |6.3} this is the vertical coordinate of each node). We assign a square to each 
edge uv with f(v) > f(u), of side length f(v)—f(u). This square will be placed 
so that its lower edge is at height f(u), and its upper edge, at height f(v). To 
find the horizontal position of these squares, we start from node s: we line up the 
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FicurRE 6.3. A planar graph and the tiling generated from it. 


squares corresponding to the edges incident with s along the bottom line, in the 
order as these edges emanate from s. We go through the nodes in the increasing 
order of the values of f. Getting to a node v, those edges entering v from below 
have squares assigned to them whose top edges are at height f(v), and these edges 
form a segment I, of length 


l= >) (F(e)-F()). 
soesee 


Since by the harmonic property of f we also have 
IbKl= So (F@-Fl)), 
ueN(v) 


f(uj2f(vr) 


so we can line up the squares corresponding to edges exiting v upwards, along I,, 
in the order given by the embedding in the plane. When we get to t, we have filled 
up the rectangle. 


FIGURE 6.4. The dodecahedron graph, its rubber band embedding 
on the line (horizontally distorted to show the structure), and a 
square tiling generated from it. 
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6.2. Tangency graphs of square tilings 


Let R be a rectangle in the plane, and consider a tiling of R by squares. Let us 
add four further squares attached to each edge of R from the outside, sharing the 
edge with R. We want to look at the tangency graph of this family of squares. 

Since the squares do not have a smooth boundary, the conclusion of Exercise 
[5.8] does not apply, and the tangency graph of the squares may not be planar. Let 
us try to draw the tangency graph in the plane by representing each square by its 
center and connecting the centers of touching squares by a straight line segment. 
Similarly as in the preceding section, we get into trouble when four squares share 
a vertex. In this case we can specify arbitrarily one diametrically opposite pair 
as “infinitesimally overlapping”, and connect the centers of these two square but 
not the other two centers. We call this a resolved tangency graph of the family of 
squares. 

Every resolved tangency graph is planar, and it is easy to see that it has exactly 
one country that is a quadrilateral (namely, the unbounded country), and its other 
countries are triangles; briefly, it is a triangulation of a quadrilateral (Figure [6.5). 


10 


FIGURE 6.5. The resolved tangency graph of a tiling of a rectangle 
by squares. The numbers indicate the edge lengths of the squares. 


Under some connectivity conditions, this fact has the following converse 


Schramm 1993}. 


Theorem 6.2. Every planar map in which the unbounded country is a quadrilat- 
eral, all other countries are triangles, and is not separated by a 3-cycle or 4-cycle, 
can be represented as a resolved tangency graph of a square tiling of a rectangle. 


Schramm proved a more general theorem, in which separating cycles were al- 
lowed; the prize to pay was that degenerate squares with edge-length 0 had to be 
allowed. It is easy to see that a separating triangle forces everything inside it to 
degenerate in this sense, and so we do not loose anything interesting by excluding 
these. Separating 4-cycles may or may not force degeneracy (Figure (6.6), and it 
does not seem easy to tell when they do. 
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degenerate 
square 


FIGURE 6.6. A separating 4-cycle that causes degeneracy and one 
that does not. 


Before proving Theorem [6.2] let us make a detour to a well-known game. The 
following folklore fact is easy to prove: 


Proposition 6.3. Let G be a planar map in which the unbounded country is a 
quadrilateral abcd and all other countries are triangles. Let us 2-color the nodes 
with black and white, so that a and c are black and b and d are white. Then either 
there is an all-black a-c path, or an all-white b-d path, but not both. 


Proof. From the planarity of G it is immediate that if an all-black a-c path 
exists, then it separates b and d, and so it excludes an all-white b-d path. To prove 
that one of these paths exists, let us start a walk through the map, avoiding the 
nodes. We enter through the edge ab, so that we have a black node to the left 
and a white node to the right, and leave through another edge that has differently 
colored endpoints (clearly again black on the left and white on the right). Going 
on similarly, we maintain that we cross only edges having a black endnode to our 
left and a white endnode to our right. It is easy to see that we never return to 
a triangle which we left earlier, and so we must return to the unbounded country 
through one of the edges bc or ad: Exiting through cd is impossible, since then we 
would have the wrong colors on our left and right (Figure|6.7} left). 

Suppose (say) that we exit through bc; then the black-black edges of those 
triangles that we cross form a walk from a to c, which contains a black a-c path. 


&f 


FIGURE 6.7. Left: Walking through a 2-colored triangulation of a 
square. Right: The game of Hex as a special case of the situation 


in Proposition 


The reader familiar with basic algebraic topology will notice that the proof 
above is very similar to one of the (algorithmic) proofs of Sperner’s Lemma. In fact 
it would be easy to derive Proposition [6.3] from Sperner’s Lemma. 
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A special case of this proposition is the fact that in every game of Hex, one or 
the other player wins, but not both (Figure [6.7] right). 


Proof of Theorem Let G be the planar map with unbounded country 
abcd. It will be convenient to set V’ = V\{a,b,c,d}. Recall that V’(P) denotes 
the set of inner nodes of the path P. For a path P and two nodes u and v on 
the path, let us denote by P(u,v) the subpath of P consisting of all nodes strictly 
between u and v. Later on we will also need the notation Plu, v) for the subpath 
formed by the nodes between u and v, with u included but not v, and P[u, v] for 
the subpath with both included. 

We consider the node-path and node-cut polyhedra, where a and c are the spe- 
cial nodes and b and d are deleted, or vice versa (see Appendix[C.2.1). Proposition 
[6.3] implies that the a-c node-cut polyhedron of G\{b,d} is the same as the b-d 
node-path polyhedron of G'\ {a,c}, and vice versa. Another way of saying this is 
that the b-d node-path polyhedron of G\ {a,c} is the blocker of the a-c node-path 
polyhedron of G \ {b, d}. 

Let P be the set of a-c paths in G\{b,d} and Q, the set of b-d paths in 
G\ {a,c}. The a-c node-cut polyhedron is described by the linear constraints 


(6.1) ty 20 (weV’) 
(6.2) a(V’(P))>1 (PEP). 
Consider the solution % of (6.1)—(6.2) minimizing the objective function )>,, x2, 
and let R? be the minimum value. By Proposition|C.2| y = (1/R?)z minimizes the 
same objective function over the blocker (which is the b-d node-path polyhedron 


of G\{a,c}), and the optimum value is 1/R?. Let us rescale the optimizers to get 
z= RE = Ry. Then we have 


(6.3 2(V'(P)) > z (P €P) 
6.4) 2(V'(Q))=>R (QEQ) 
(6.5) ae =1. 

ueV’ 


It will be convenient to define z, = z = % = Za = 0. 

We assign the length 2; = $(zi+z;) to every edge ij. Using this, we can 
define (as usual) the length 2(P) of path as the sum of lengths of its edges, and 
the distance d,(u,v) of two nodes as the minimum length of any path connecting 
them. Inequalities and say that d.(a,c) > 1/R and d,(b,d) > R, and 
the minimality of © and Y implies that we have equality here. It is easy to see that 
d,(i,j) = 2; for every edge ij, so the edge is a shortest path between its endpoints. 

We know that 7 is in the a-c node-path polyhedron, and it is a minimal vector 
in there, so it is in the a-c node-path polytope, and it can be written as a convex 
combination of indicator vectors 1p of sets V’(P), where P is an a-c path. It 
follows that z can be written as 


(6.6) z= S>dpIp, So\P=R, AP ZO. 
PEP P 


Similarly, we have a decomposition 


1 
(6.7) z= > pela, S HQ = R Ma 2 0. 
QeQ Q 
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Let P’ ={P EP: Ap > O}, and define Q’ analogously. Obviously, a node 
u €V’ has z, > 0 if and only if it is contained in one of the paths in P’, which 
must then be equivalent to be contained in one of the paths in Q’ (we'll prove later 
that all nodes have z, > 0). 

From conditions (6.3)-(6.5) we can derive some simple but powerful properties 
of the paths in P’ and Q’. It is trivial from the topology of G that |V(P)NV(Q)| > 1 
for P € P’ and Q € Q’. On the other hand, implies that 


t= 2 = (rete) (Sele) =D renelV(PNV(Q) 
ueV! P Q P,Q 
> S> APHe = 63 Ap) 63 10) =1. 
P,Q P Q 


We must have equality here, which implies that 
(6.8) V(P)NV(Q)|=1 (PEP, QeQ’. 
For any path Q € Q’, we have 


(6.9) 2(Q)=2"lg = S> API pla = D> APIV'(P)NV'(Q)| = So AP =P. 


PEP! PEP’ PEP’ 


Similarly, for every P € P’, we have 
(6.10) 2(P) = —. 


So the paths in P’ and Q’ are shortest paths with respect to the metric d,. It 
follows that they are chordless, and also for every path P € P’ and u,v € V(P), the 
subpath P[u,v] is a shortest path between its endpoints. We get by the same kind 
of argument that the common nodes of any two paths P, P’ € P’ are encountered 
by the two paths in the same order. 

A consequence of is that a b-d path Q satisfies 2(Q) = R (so it is a shortest 
b-d path) if and only if |V(P)NV(Q)| = 1 for every P € P’. 

We say that a P’ € P’ is to the right of P € P’, if every node of P’ is either 
a common point of P and P’, or is separated from b by P. Clearly this defines a 
partial ordering of P’. In this case we say that P and P’ do not cross. We can 
similarly define noncrossing paths in Q’, and a partial ordering. 

We continue with two somewhat more elaborate properties of the paths in P’ 
and Q’. The first of these arguments could be omitted at the cost of making the 
arguments later less transparent. However, under the name of “uncrossing”, this 
method is a standard and powerful step in many proofs in graph theory, and it is 
worth describing. 


Claim 1.We can choose the decompositions in and (6.7) so that the families 


P' and QO are pairwise non-crossing. 


In other words, “to the right” defines a total order on P’. Similarly, we get a 
total order on Q’. 

To prove the Claim, let P,P’ € P’ be a pair of paths that are crossing. Using 
that their common nodes are in the same order along both, we can construct two 
other a-c paths Py and Pj, consisting of the common nodes and those nodes of 
each path that are to the left (resp. to the right) of the other path (Figure [6.8] 
left). Clearly 2(P)+2(P') = 2(Po)+2(P3), and since P and P’ satisfy with 
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equality, so do Pp and Pj. Let ¢ = min{Ap, Ap’}, and let us decrease Ap and Ap: 
by € and increase Ap, and A PI by ¢. This gives another decomposition in (6.6). So 
one of P and P’ (say, P’) drops out of P’, and Po and P} enter it (they may have 
been there already, in which case only their weight increases). 

This describes the main step, but we want to repeat this procedure to get rid 
of all crossings; to this end, we have to justify that we are making progress. This 
is not quite obvious, since we have replaced two paths by (possibly) three, creating 
new crossings with the other paths. Looking more carefully, we see that 

(a) the remaining path P and the new paths Py and Pj are mutually noncross- 
ing, and 


(b) if a further path in P’ crosses either one of Pp and P}, then it crosses at 
least one of P and P’; and if it crosses both Pp and P}, then it crosses both P and 
P’. 

From these observations it follows that the sum 

dy, pdp, 
P,,P2€P’ crossing 


decreases at the above operation, and so if we start with a decomposition minimizing 
this sum, then the family P’ will consist of non-crossing paths. We argue for Q’ 
similarly. This proves the Claim. 


FIGURE 6.8. Left: uncrossing two paths. Right: every node has 
positive weight. 


Let u € V’. Every path in P’ either goes through wu or it separates u from 
exactly one of 6 or d in the topology of the plane. Let P;, P,, and P} denote the 
sets of paths in P’ separating u from b, passing through wu, and separating u from d, 
respectively. Clearly the sets P,, P,, and P,* form intervals in the ordering of P’. 
We define the partition QO’ = QO, UQ,,UQ7 analogously: the paths in Q~ separate 
u from a etc. If Q € Q’ is any path through wu, then it is easy to tell which paths 
in P’ separate u from b: exactly those whose unique common node with Q lies on 
the subpath QJ[b, uw). 

Clearly A(Pu) = (Qu) = zu. We can express the distance of a node u € V’ 
with z, > 0 from b as 


(6.11) d,(b, u) = S +X(P,, ). 
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Indeed, choosing any Q € Q,, a path P € P’ belongs to P, if and only if it 
intersects Q(b, u), and such paths contribute Ap to 2, for a single node v on Q(b, u). 


Claim 2. Every nodeic V’ satisfies z; > 0. 


Suppose that there are nodes with z; = 0, and let H be a connected component 
of the subgraph induced by these nodes. There are no separating 3- and 4-cycles 
by the hypothesis of the theorem, hence H has at least five neighbors in V\ V(#). 
Each neighbor v of H has weight z, > 0, and so it is contained in a path P, € P’ 
as well as in a path Q, € Q’. The path P, is disjoint from H and separates H 
either from 6 or from d, and similarly, the path Q, separates H from either a or 
c. Thus there are two neighbors u and v of H so that both P,, and P, separate H 
from (say) b, and both Q,, and Q, separate h from (say) a. We choose the same 
path for P, and P, if possible, and similarly for Q,, and Q,, but implies that 
we cannot succeed in both cases. So we may assume that Qy 4 Qo. 

The rest of the argument consists of analyzing the topology around H. If 
P,, = Py, then we may assume that u comes before v along the path P,. The path 
Q» separates u from a, so P,, meets Q, before u, and then meets it again at v, 
contradicting (6.8). 

Suppose that P,, #4 P,. Then, as before, it follows that P,,[a,u) meets Qy. 
Since P,, and Q, have only one intersection point, it follows that Q, is disjoint 
from P,,[u,c]. Similarly, P, is disjoint from Q,,[u, d]. 

The paths P,, and Qy cross each other at wu and split the interior of abcd into 
four open parts (Figure [6.8] right). The subgraph H is contained in the part T 
incident with the edge cd, and since v is a neighbor of H and not on the paths P,, 
and Q,,, it must also be in T. Now the path P,|a,v] must enter T through a node 
of P,,(u, c], since it is disjoint from Q,,[u, d]. Similarly, the path Q,[b, v] must enter 
T though Q.(u,d]. But then P,[a,v] and Q,[b,v] must have an intersection point 
outside T, different from v, contradicting (6.8). 


After all this preparation, we can describe the squares representing G. We 
start with the rectangle R with one vertex at (0,0) and the opposite vertex at 
(R,1/R). Every node u € V’ will be represented by a square S$, centered at 
Pu = (dz(a,u),dz(b,u))" and having edge length z,,. We represent node a by the 
square S, of side length R attached to the top of SR. We represent 6b, c and d 
similarly by squares attached to the other edges of K. 


Claim 3. Leti,j ¢ V. Ift and j are adjacent, then S; and S; are tangent. If i 
and j are nonadjacent, then S; and S; are either disjoint or tangent along a single 
vertex. 


The Claim is easily checked when i and/or j belong to {a, b,c, d}, so suppose 
that 7,7 € V’. 

First, let i7 € E(P), where (say) P € P’. Assume that P encounters i before 
j. Then d,(a,j) = dz(a,i)+2,; (since P is a shortest path). On the other hand, 
let (say) dz(b,7) > dz(b,2), and let Q € Q’ go through 7 and Q’ € Q’ go through 
j. Then Q” = Q[b, 7] U {77} UQ’[j, d] is a b-d path, and since Q” intersects P in at 
least two nodes, it is not a shortest b-d path. Thus z(Q”) > R, which implies that 


d.(b, 4) = 2(Q[b, i) > R—-2(Q', d]) — %j = 2(Q"[b, j)) — 23 = de (0, 7) — %j- 


So |d,(b, 7) —d-(b,i)| < %;. This proves that the squares 5; and S; are tangent 
along horizontal edges. 
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Second, let ij € E(G), and assume that 7j is an not an edge of any path in 
P'UQ". It is clear that P;P; = 0, since for any path in P’ passing through both 
i and j, the edge 77 would be a chord. Since P; and P; are intervals in the ordering 
of P’, it follows that (say) P; is completely to the left of P;. Since i and j are 
adjacent, no path in P’ can separate i and j, and hence NP; =P; UP;. Applying 
for both 7 and j, we see that 


ay ay 


(6.12) d(b, j)—dz(b,4) = F-—F +P; ) MPP) = F—F+APs) 
1 
= 5 (%i +2). 


Finally, assume that 7 and j are nonadjacent nodes. If there is a path Q € Q’ 
going through both 7 and 7, encountering (say) 7 before 7, then 
dz(b, j) = dz(b, i) +dz(i, J) > dz(b,i) + 


a + ay 


and hence S; and S; are disjoint. We conclude similarly is there is a path in P’ 
through both i and j. We are left with the case when P;| NP; = Q;:NQ; =. As 
in the previous proof, we may assume that P; is completely to the left of P;, but 
we can only conclude that VP; CP; UP;, and so the computation in gives 
an inequality only: dz(b,j)—dz(b,i) > $(zi+2;). Similarly, |dz(a, j)—dz(a,i)| > 
(2; +2;). If strict inequality holds in both directions, then S; and $; are disjoint; 
if equality holds in both cases, then S; and S; have a vertex in common. This 
proves Claim 3. 


Consider G as drawn in the plane so that node i is at the center p; of the 
square 5;, and every edge 7j is a straight segment connecting p; and p;. This gives 
a planar embedding of G. Indeed, we know from Claim 3 that every edge 77 is 
covered by the squares S; and S;. This implies that edges do not cross, except 
possibly in the degenerate case when four squares S;,5;,S,,5; share a vertex (in 
this clockwise order around the vertex), and ik, jl © E. Since the squares S; and 
S; are touching along their vertical edges, we have 77 € EL. Similarly, jk, kl, li € E, 
and hence i, j,k, ! form a complete 4-graph. But this is impossible in a triangulation 
of a quadrilateral that has no separating triangles. 

Next, we argue that the squares S; (i € V \{a, b,c, d}) tile the rectangle R = 
(0, R] x [0,1/R]. It is easy to see that all these squares are contained in the rectangle 
MR, and they are non-overlapping by Claim 3. The total area covered by the squares 
is Vey, 2? = 1, which is just the area of ®, so they must cover 9. 

Finally, we show that an appropriately resolved tangency graph of the squares 
S; is equal to G. By the above, it contains G (where for edges of type (iii), the 
4-corner is resolved so as to get the edge of G). Since G is a triangulation of the 
outside quadrilateral, the containment cannot be proper, so G' is the whole tangency 
graph. 


Exercise 6.1. Figure out how to resolve the common points of four squares in 
the tiling in Figure [6.2] in order to get the dodecahedron graph. 

Exercise 6.2. Verify that the graph in Figure|6.9]gives rise to a tiling of a square 
by different size squares. (This construction provides the minimum number of 
different squares tiling a square.) 


Exercise 6.3. Prove that if G is a resolved tangency graph of a square tiling of 
a rectangle, then every triangle in G is a country. 
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FIGURE 6.9. Tiling a square with the smallest number of different squares. 


Exercise 6.4. Construct a resolved tangency graph of a square tiling of a rec- 
tangle that contains a quadrilateral with a further node in its interior. 


Exercise 6.5. Let G be a maximal planar graph with at least 4 nodes, with 
unbounded country abc. Let us 2-color the nodes different from a, b and c with 
black and white. Prove that either there is a black node connected by black paths 
to each of a, 6 and c, or there is a white node connected by white paths to each 
of a, b and c, but not both. 
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CHAPTER 7 


Discrete Analytic Functions 


We have seen in Chapter[4]that the notion of harmonic functions in analysis has 
an analogue in graph theory, and this analogue has proved very useful in the study 
of electrical networks, rubber band embeddings, random walks, and elsewhere. In 
this chapter we discuss an extension of analytic functions to graphs; for this to be 
meaningful, we have to consider graphs embedded in orientable surfaces (maps). 

In analysis, a complex analytic function is just a pair of real harmonic functions 
connected by the Cauchy-Riemann equations. We know what harmonic functions 
are on graphs, but what is the “right” analogue of the Cauchy-Riemann equations? 

The motivation comes from numerical analysis. In order to do numerical com- 
putations with an analytic function, we have to restrict its domain to a finite set, 
most naturally to a large piece of a fine enough grid. Then we can mimic dif- 
ferentiation by taking the difference of values on adjacent gridpoints, integration 
by summation along grid paths etc. In this setting, important properties, like the 
Cauchy—Riemann equations and the Cauchy Integral Theorem, will only remain ap- 
proximately true. Ferrand and later Duffin suggested to turn this around: use only 
approximate values of the function on the grid, but require that appropriate forms 
of (say) the Cauchy Integral Theorem should hold exactly. The resulting theory 
of discrete analytic functions on the grid leads to interesting results, even though 
some properties of analytic functions seem to be irrevocably lost (for example, the 
product of analytic functions will be no longer analytic). 

In fact, we study two different, but closely related notions: discrete analytic 
functions, and discrete holomorphic forms (in graph theory terms, these will be 
called rotation-free circulations). We have seen that no nonconstant function can 
be harmonic everywhere on a finite graph, and accordingly discrete analytic func- 
tions are best defined on countable graphs embedded in the plane; on the other 
hand, discrete holomorphic forms can be best studied on finite graphs embedded 
in compact orientable surfaces. 

Discrete analytic functions on the square grid were introduced in the 
1940’s and studied quite extensively later on [Duffin 1956], 
For the case of a general map, the notion of discrete analytic functions 
is implicit in the classic paper (cf. also Section in 
this book) and more recent work [Benjamini-Schramm 1996a). These func- 
tions were formally introduced by [Mercat 2001], and applications were given 
in [BenjaminiLovasz 2002), and, perhaps most: impor: 
tantly, in [Smirnov 2010b]|Chelkak-Smirnov 2012], where they were used in the 
proof of the conformal invariance of the Ising model on the square grid (to 
be discussed in more detail in Chapter (8). Similar, but different theories of 
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discrete analytic functions (not discussed here) were developed in [Isaacs 1952), 
Dynnikov—Novikov 2003), |Kiselman 2005). 


7.1. Warmup: discrete analytic functions on the grid 


7.1.1. Grid graphs and lattice domains. We consider the square grid as 
a (countable) map in the complex plane, whose node set is the set G of Gaussian 
integers (complex numbers a+ bi, where a, b € Z), and two such nodes are connected 
by an edge if and only if their difference is +1 or +7. We are going to denote, with 
a little abuse of notation, this graph by G. Its dual graph G* can be obtained from 
G by shifting it by (1+7)/2. Let G, (r = 0,1) denote the set of Gaussian integers 
a+bi with a+b = r (mod 2). Note that G,. is a grid itself, where we consider 
two lattice points adjacent if they are nearest, which in this case means that their 
difference is +(1+7) or £(1—1). 

The graph G is bipartite, with bipartition Gp UG; sometimes we are going to 
call the nodes in Go “black”, and those in G,, “white”. So the origin is black. 
We can also 2-color the bounded countries of G (the nodes of G*) with black and 
white (like on a chessboard); to be specific, let each square get the same color as 
its Southwest corner. 

We will also need to define an orientation of these graphs. There are two natural 
orientations we can consider: we can orient each edge so that the country on its 
left is black (say), or we can orient it from the black endnode to the white endnode. 
We call these two orientations the cyclic and acyclic orientation, respectively. If we 
start with the acyclic orientation of G, and define the orientation of G* as described 
in Chapter [2| then we get the cyclic orientation of G* (Figure [7.1] left). 


eee Sane nee ean ee 


FIGuRE 7.1. Left: A cyclic orientation of the planar grid and the 
corresponding acyclic orientation of its dual. Right: a lattice do- 
main D and the graphs Gp and Gj. Note that the doubled edge 
is not an edge of Gp, so Gp is not an induced subgraph of G. 


The subset of the plane bounded by a simple closed polygon in G will be called 
a lattice domain (Figure [7.1] right). The vertices and edges of the lattice graph G 
in a lattice domain D form a subgraph Gp. Note that Gp is not necessarily an 
induced subgraph of G; but Gj, consisting of all nodes representing squares whose 
interior if contained in the interior of D, is an induced subgraph of G*. This is the 
tangency graph of its squares (which we do not resolve into a triangulation, unlike 
in Chapter {6}: we connect the centers of two squares contained in D if and only if 
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they share an edge. This graph Gj, can be obtained from the dual graph of G by 
removing the node representing the unbounded country of G. 


7.1.2. Discrete analytic functions. A discrete analytic function is a func- 
tion f: G—C, satisfying the equation 


(7.1) f(z+i)— f(z4+1) = i(f(z+1+4)- f(z)) 
for every z € G. This equation has a nice geometric meaning: the diagonals of the 
image of any lattice square under a discrete analytic function are orthogonal and 
of equal length. We can write this equation as 
(7.2) f(z) tif(zt)+?f(zti4+ D+ f(z+i) =0. 
Dividing by i—1, we get 

f(ztit+D—f(2) _ fe+)—fE+V) 
(7.3) : = : 

i+1 t—1 

This latter equation can be thought of as a discrete version of the fact that the 
directional derivative of a complex analytic function is the same in all directions; 
in other words, it gives a discrete version of the Cauchy—Riemann equation. 

We can allow more general discrete analytic functions, namely functions defined 
on the lattice points in a lattice domain. In this case, we stipulate equation 
for all lattice squares contained in this domain. 

Discrete analytic functions form a linear space over C, i.e., linear combinations 
of them is discrete analytic again. It is trivial to check that every linear function, 
when restricted to Gaussian integers, is a discrete analytic function. Furthermore, 
the restriction of any quadratic polynomial to the Gaussian integers is a discrete 
analytic function. It suffices to verify this for the function z?, which is straightfor- 
ward: 


(z+1)?—(z+i)? = 2(¢-1)z4+2 =i((z+i+1)?-27), 


verifying (see Figure[7.2). 


FIGURE 7.2. The image of the standard grid under the discrete 

analytic function z+ z?. Every vertex other than the origin is the 

image of two lattice points, negatives of each other. 
In contrast to the previous example, the restriction of the function f(z) = 2° 
to the Gaussian integers is not discrete analytic. In fact, substituting z = 0 in 
(7.1), we get 3-13 = —i—1 on the left, while i((1+7)? —0°) = —2(1+7) on the 
right. This shows that the product of two discrete analytic functions is not discrete 
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analytic in general. This fact, which is in strong contrast to the continuous case, is 
a major difficulty in this theory. 

Some other operations on “true” analytic functions do work out quite well. Let 
f be a function on the Gaussian integers. Let P = (20, 21,..-, 2k) be a walk on the 
grid, ie., ze41—2% € {£1, ti}. We define the “line integral” 


. f 


hk 
[1 jaz = Sot (241 aa I) toy z;). 
j 


j=0 


The nice fact about this integral is that for analytic functions, it is independent of 
the path P, depends on its endpoints only. More precisely, let P and P’ be two 
walks on the grid, with the same beginning node and endnode. Then 


(7.4) [ro dz = [re dz 
P P! 
This is equivalent to saying that 
(7.5) [re dz = 
P 


if P is aclosed walk. It suffices to verify this for the case when P is a simple polygon, 
which we can recognize as a discrete version of the Cauchy Integral Theorem. The 
proof is quite easy: the case when P is the boundary of a single square is just a 
rearrangement of equation (7.3). For a general lattice polygon P, follows by 
summing over all lattice squares in the interior of P. 

It follows that we can write ee f(z) dz, without specifying the path from u to 
v along which we integrate. Furthermore, it is straightforward to verify that for a 
fixed u, ‘es f dz is a discrete analytic function of v. 

For a “true” analytic function, its real and imaginary parts are harmonic func- 
tions. For discrete analytic functions, this connection is a bit more complicated. 
Let f be a discrete analytic function on the grid G, and let f, denote its restriction 
to G,. (r = 0,1). It is easy to see that the functions fo and f; determine each other 
up to an additive constant. 


The function f, is a harmonic on G,. Indeed, for z € Go (say) we have 
fo(z+i+1)— fo(z) = i(fo(z+1)— fo(z+4)), 
fo(z+i—1)— fo(z) = i(folz+4) — fo(z—D), 
fo(z—4-1) — fo(z) = i(fo(z—1)— fo(z—2)), 
fo(z—4+1)— fo(z) = i(fo(z—#) — fo(z+1)). 


Summing these equations, we get 
fo(e+i+1) + fo(z+i—1)+ folz—-—t—-1)+ fo(z—*+1)—4fo(z) = 0 


which means that fo is harmonic at z. 
Conversely, let fo be a harmonic function on Gg. By the potential argument, 
we can find a function f; on G; such that 


fi(z+1)— filz+i) = —t(fo(z+i+1)— fol(z)) 
filz+i)—filz—1) = —-é(fo(z+i-1)— fo(z)) 
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for every z € Go. It is easy to see that f; is harmonic as well, and foU fi is a 
discrete analytic function on the grid. 

Let us also notice that a complex valued harmonic function is simply a pair 
of real valued harmonic functions (its real and imaginary parts). Furthermore, the 
real part of f; is related to the imaginary part of fo, and vice versa. Hence it 
suffices to study analytic functions that are real on Go and imaginary on G;. We 
can forget about the factor 7 on G;, and consider a pair of functions f : Go > R 
and g: G; > R such that the following version of the Cauchy—Riemann equations 
holds: 

(7.6) glz+1)—glz+t) = fletit+1)—flz) 
g(2+i)—g(2-1) = fle+i-1)- Fle) 
We call the pair (f,g) an analytic pair on the grid. 

We can define discrete analytic functions on the nodes of Gp for some lattice 
domain D, instead of all lattice points. Condition is then required only for 
lattice squares contained in D. Properties of discrete analytic functions discussed 
above extend to this more general setting easily. 

Further constructions like analytic polynomials on the grid, and some properties 
of discrete analytic functions, similar and not similar to properties of “true” analytic 
functions, are described in Exercises|7.1 


7.1.3. Strongly analytic functions. Smirnov introduced a 
class of functions satisfying a stronger condition than analycity. Consider a lattice 
domain D and the graph Gp with the cyclic orientation. We say that a function g 
defined on D is strongly analytic, if 


g(v)—g(u) || Vo-u 
for every edge uv € Gp. (Note that the sign of the square root plays no role, but 
the orientation of the edge is important: 0—W is orthogonal to Wu—W.) Strongly 


analytic means that g(z+1)—g(z) is real if z € Gp, and imaginary otherwise; 
g(z+7)—g(z) is parallel to 1+7 if z € Go, and to 1—7 otherwise (Figure [7.3] left). 


FIGURE 7.3. Left: directions of differences along edges of a 
strongly analytic function. Right: the image of a lattice square 
under a strongly analytic function. (The quadrilateral may be ro- 
tated by any multiple of 45°.) 


It is easy to figure out the shape of the image of a (say, black) square (Figure 
right). The image of a lattice square (assuming it is nondegenerate) is always 
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concave, but never self-intersecting; if its two shorter edges (at the concave angle) 
have lengths a and b, then the two longer ones have a+ 2b (next to b) and b+ V2a 
(next to a). In particular, the sums of squared length of opposite edges are equal: 
(a+ V2b)? +a? = (b+ V2a)?+b? = 2a?+2V/2ab+2b?. Three vertices of it form 
a triangle with one angle of 45°, and the fourth point is the orthocenter of this 
triangle. It also follows that the diagonals of the quadrilateral are perpendicular 
and of equal length. Checking the directions of the diagonals too, we see that 


g(z+i)—g(z+1) = i(g(z+i+1)—g(2)), 
which implies that every strongly analytic function is analytic (justifying the name). 


Example 7.1. Consider the function z + z on G; this is analytic, but not strongly 
analytic. We get further analytic functions by adding a complex number a at every 
lattice point in G,. If |a| = 1, then it is easy to check that the concave quadrilateral 
(0,1+a,1+i,i+a) (the image of the lattice square (0,1,1+7,2) has the property 
that its opposite edges are perpendicular, and so are its diagonals. By elementary 
geometry, it follows that the angles of this quadrilateral are, in order, 45°, 225°, 45°, 
45°. If we divide all function values by 1+a, then the edges of this quadrilateral 
will be parallel, in order, to 1,1—7,2,1+7. Thus this function is strongly analytic. 
In formula: 


A if z€ Go, 
+a 
92) =4 z4+4 (jal = 1). 
ifzEeG 
l+a 


See Figure 4 


FIGURE 7.4. The image of the grid under a strongly analytic function. 


Exercise[7. I]tells us that prescribing the values of a discrete analytic function on 
the coordinate axes determines the function uniquely. Analogously, if we prescribe 
the value of a function at the diagonal points «+7, then it has a unique extension 
g to a strongly analytic function. Indeed, if we know the values at the opposite 
vertices of a square (say at z and z+7+1), then g(z+1) and g(z+7) are determined 
by the directions of the corresponding edges. Hence we can define g recursively 
farther and farther away from the diagonal. Example [7.1] above is the extension of 
the function g(a+ai) = t**2. 

Strongly analytic functions form a linear space over R, but not over C. In fact, 
they generate the linear space of all discrete analytic functions over C (Exercise 
7.9). Many interesting properties of such functions are proved in [Smirnov 20105], 
eee teres and [Chelkak-Smirnov 2012]. See Exercises and [7.10] 


for further important properties of strongly analytic functions. 


7.2. DISCRETE ANALYTIC FUNCTIONS ON PLANAR GRAPHS 97 


7.2. Discrete analytic functions on planar graphs 


We want to define analytic functions not only on the grid, but on more general 
planar maps. Definition does not generalize (unless all countries are quadri- 
laterals, a case to which we will come back), but (7.6) does suggest a reasonable 
generalization. 

Let G be a planar map and let G* be its dual, with a reference orientation. We 
call a pair of functions f: V—- Randg: V* > R an analytic pair, if the discrete 
Cauchy—Riemann equation is satisfied for every edge: 


(7.7) g(Is(e)) —g(rs(e)) = f (hd(e)) — f (tI(e)). 


Relation implies a number of further properties; for example, it follows that 
f is harmonic. Indeed, let wv1,...,uvg be the edges incident with u € V, in 
counterclockwise order, and let p; be the country in the corner of uv; and wvj+1 
(we assume, for simplicity of presentation, that the edges are all oriented away from 
u). Then 


k k 
(7.8) >= (Fe) F(@) = Y2 (905) — 9(pj-1)) = 0. 


j=1 j=1 


But this is bad news, since we have seen that no nonconstant function can 
be harmonic at all nodes of a finite graph. We can get around this obstacle in 
two ways: either we allow “singularities” at some locations, or we consider infinite 
graphs. The first approach is quite easy if we disregard the edges of the unbounded 
country (see Exercise [7.11). We are going to focus on the second. 

To this end, we have to redefine (for this section only) what we mean by planar 
maps. A planar map means an (infinite) graph G that is embedded in the plane 
so that every country is an open topological disk whose closure is compact, and 
every bounded piece of the plane meets a finite number of edges and nodes only. In 
particular, this implies that V and F are countable, every node has finite degree, 
and every country has a finite number of edges. Such a map has a dual G* = 
(V*, E*), which can be represented as another planar map. The lozenge graph G° 
and other simple manipulations on planar graphs can be defined in a straightforward 
way. 

In this more general setting, the definition can be required for every edge 
without forcing the function to be trivial; recall the examples from Section [7.1] 

Note that for an analytic pair (f,g), the function f determines g up to an 
additive constant; furthermore, for such a g to exist it is necessary and sufficient 
that f is harmonic. Indeed, defines a function on the edges of G*. The 
computation in (7-8), followed backwards, tells us that this function adds up to 
zero around each country of G*, and so the “potential argument” gives us the 
existence of g. 

In this sense, the theory of discrete analytic functions on planar maps is just 
another formulation of the theory of harmonic functions on such maps. However, 
the existence of the function on the dual map, which is harmonic as well, leads to 
a richer theory. 

It is often convenient to think of the two functions f and g above as a sin- 
gle function f defined on VUV* = V®, and allow complex values. We say that 
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f: V° > Cis analytic, if 


(7.9) f (Is(e)) — f (rs(e)) = 2(f(hd(e)) — F(tl(e))) 


for every edge e € F. As we noted for the grid before, the complex version is not 
substantially more general than the real. Indeed, (7.9) only relates the real part 
of fly to the imaginary part of f|y» and vice versa. In other words, a complex 
analytic function f can be decomposed as the sum of two very special analytic 
functions f; and fo: here f; is real on V and imaginary on V%*, and f2 is the other 
way around. 


Example 7.2 (Rubber bands). Let G be a toroidal map. We consider the 
torus as R?/Z?, endowed with the metric coming from the Euclidean metric on R?. 
Let us replace each edge by a rubber band, and let the system find its equilibrium. 
Topology prevents the map from collapsing to a single point: every non-contractible 
cycle remains non-contractible. In mathematical terms, we are minimizing the 
energy )> <p f(e)*, where the length ¢(e) of the edge e is measured in the given 
metric, and we are minimizing over all continuous mappings of the graph into the 
torus homotopic to the original embedding. 

It is not hard to see that the minimum is attained. Clearly, the edges are 
mapped onto geodesic curves (which become straight line segments if we “roll out” 
the torus). A nontrivial fact is that if G is a simple 3-connected toroidal map, then 
the rubber band mapping is an embedding. This can be proved along the lines of 
the proof of T heorem [3.2] and it also follows from Theorem [7.4] below. 

Rolling out the torus, we get a doubly periodic embedding of an infinite graph 
G in the plane. From the minimality of the energy it follows that every node is 
at the center of gravity of its neighbors. So considering the points of the plane as 


complex numbers, the embedding defines a function on V(G) that is harmonic at 
every node, and hence it is one component of an analytic pair on G (Figure 7.5). 4 


FIGURE 7.5. Periodic rubber band maps obtained from toroidal graphs 


It is rather straightforward to extend these notions to planar maps with 
weighted edges. We weight the edges of G by w € RE; this defines a weighting 
of the edges of G* by w* = 1/we-. We call a function f : V° — C analytic for 
the edge-weighted graph G, if the weighted version of the discrete Cauchy-Riemann 
equation 


(7.10) f(hd(e)) — f (tI(e)) = twe (F(rs(e)) — f(Is(e))) 
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is satisfied for every edge e € E. Again, we can consider instead two real-valued 
functions R(f) : V —> R and 3(f) : V* > R, and (completely independently) 
other two, namely 3(f): V —+ Rand R(f): V* > R. Both pairs satisfy the real 
version of the Cauchy—Riemann equations. 


7.2.1. Integration and critical functions. Let f and g be two analytic 
functions on the nodes of the lozenge graph G® of a weighted graph in the plane. 
We define integration along a path P = (vo, v1,..., Uz) in G®: 


k-1 
[fag PPO (Glu j45)—9(0))) 
P j=0 


This can also be written as 
k-1 


2 [ Fag = F(v0)g(v0) +> 
J 


j=0 


fv; 


a(v)) g(vj41)| F(vn)g(ve)- 


The nice fact about this integral is that it depends on the endpoints of the path 
only. By the usual argument, it suffices to verify that this integral is zero when 
P the boundary of a country of G°, which is a straightforward computation. It 
follows that we can write sib f dg. It is also easy to check the rule of integration by 
parts: for u,v € V°, 


(7.11) [ f49= foalv)- Fwalu)- f gat. 

These have been the good news. Now the bad news: for two analytic functions 
f and g, and a fixed starting node u, the function F(v) = ‘i f dg is not analytic 
in general. A simple computation gives the “analyticity defect” of an edge wv and 
dual edge e = pq: 


(7.12) 

F(v) — F(u) —ituy(F(p) — F(q)) = i(f(~) — £(w)) [9(v) + 9(u) — 9(@) —9(0)]- 
Assuming that the function g is such that the factor in brackets in (7.12) is zero 
for every edge: 


(7.13) g(Is(e)) +9(rs(e)) = g(hd(e)) +9(tl(e)), 

the integral function F' will be analytic. We call such a function g critical. What 
we have found above is that iM f dg is an analytic function of v for every analytic 
function f if and only if g is a critical analytic function. 

This notion was introduced, in a somewhat different setting, in 
under the name of rhombic lattice. Critical maps were defined in [Mercat 2001): 
these are maps which admit a critical analytic function. 

Recall condition on g being an analytic function: 


g(hd(e)) —g(tl(e)) = iwe (g(rs(e)) — g(Is(e))). 
These equations, together with (7.13), have a nice geometric meaning: the diagonals 
of the quadrilateral g(Is(e)) g(rs(e)) g(hd(e))g(tl(e)) have the same midpoint, and 
they are orthogonal. In other words, this quadrilateral is a rhombus. (In the 
unweighted case it is a square, so in this case, we are back to the study of analytic 
functions on the square grid!) 
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Consider the critical analytic function g as a mapping of GUG™* into the com- 
plex plane C. This defines embeddings of G, G* and G® in the plane with following 
(equivalent) properties: 

(a) The countries of G° are rhombi. 

(b) Every edge of G° has the same length. 

(c) Every country of G is inscribed in a circle with the same radius. 

(d) Every country of G* is inscribed in a circle with the same radius. 

If you want to construct such maps, you can start with a planar map straight- 
line embedded in the plane so that all countries are rhombi; this is bipartite, and 
you can define a critical graph on its white nodes by adding he appropriate diagonal 
to each rhombus and deleting the original edges (see Figure 7.6). 


FIGURE 7.6. A map with rhombic countries, and the critical graph 
constructed from it. 


Critical maps have a nice interpretation in terms of the Delaunay diagram: 
it means that all Delaunay circles of the node set V have the same radius. This 
is clearly invariant under duality. It follows that the Voronoi graph of V is the 
Delaunay graph of the dual (Figure [7.7| left). 


FIGURE 7.7. Left: the Delaunay circles of the node set of a critical 
map. Right: A track in a rhombic graph. 


Whether or not a weighted map in the plane admits a critical analytic function 
depends on the weighting of the edges. Which maps can be weighted this way? 
answered this question. Consider any country Fp of the 
lozenge graph GY, and a country F, incident with it. This is a quadrilateral, so there 
is a well-defined country F2 so that Fo and F> are attached to F, along opposite 
edges. Repeating this, we get a sequence of countries (Fo, F,, F)...). Using the 
country attached to Fo on the opposite side to F, we can extend this to a two-way 
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infinite sequence (..., F_1, Fo, Fi,...). We call such a sequence a track (see Figure 
right). 


Theorem 7.3. A planar map has a rhombic embedding in the plane if and only 
if every track consists of different countries and any two tracks have at most one 
country in common. 


7.2.2. Embedding by analytic functions. Restricting a complex analytic 
function f on G° (where G is an infinite planar map) to V, we get. a mapping of its 
nodes into the complex plane. We can extend this map to a drawing of the whole 
graph by mapping each edge wv onto the segment connecting f(u) and f(v). It 
turns out that under rather general conditions, this mapping is an embedding in 
the plane. 


Theorem 7.4. Let G be a 3-connected map in the plane and let f be a complex 
analytic function on G. Suppose that there exist a constant c such that 


on 1 @)-FOl 


ec dg(u,v) 


for every pair of distinct nodes u,v € V°. Then f defines an embedding of G in the 
plane such that every country is a convex polygon, and every node is in the center 
of gravity of its neighbors. 


Proof. Analogously to the proof of Theorem|3.2| the main step in the proof is 
the following. 


Claim 1. For every (open or closed) halfplane H, the sete S={ieV: fli) € H} 
induces a connected subgraph. 


We may assume that H is the halfplane {y > 0}. Let u and v be two nodes 
in S, we want to show that they can be connected by a path whose image stays in 
the halfplane H. We use induction on dg(u,v). Consider a shortest path P in G 
connecting u and v. We may assume that P is not just an edge, and that all the 
inner nodes of this path are outside H. Let w be a node on P which is lowest. 

We claim that there exists an infinite path P, = (uo = u,w1,...) starting at 
u such that f(P,,) lies in the upper halfplane and the distance of f(u,) from the 
x-axis tends to infinity. Let G’ be the connected component of G[S] containing w. 
It suffices to show that the distance of points of f(G’) from the z-axis is unbounded, 
since then the existence of P,, follows by simple compactness. 

Suppose that f(G’) lies in a strip 0 < y < yo, and let U denote the connected 
component of C\ f(G’) containing the halfplane y > yo. The boundary of U is a 
polygon, whose vertices are points f(i), i © V and intersection points of images 
of edges. Neither type of vertices can give a concave angle, and so U is convex; 
since U contains a halfplane, it follows that U is a halfplane. But its boundary 
cannot contain a line (in fact, it cannot contain any segment longer than c), which 
is clearly impossible. This proves that P,, exists as claimed. 

Similarly, we can find an infinite path P, = (vo = v, v1,...) such that f(P,) lies 
in the upper halfplane and the distance of f(v,) from the z-axis tends to infinity, 
and an infinite path P, = (wo = w,wi,...) such that f(P,) lies in the lower 
halfplane and the distance of f(w,) from the z-axis tends to infinity. Clearly P,, is 
node-disjoint from P, and P,, and we may assume that its only intersection node 
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with P is w. If P, and P, intersect, then the conclusion is trivial, so assume that 
they are node-disjoint. 

Consider PU P,,U Py U Py in the original planar embedding of G. This subgraph 
splits the plane into three infinite regions; let Q be the region bounded by P,,U 
PUP,. It is easy to see that there are infinitely many disjoint paths Q),Q2,... 
connecting P, and P, inside Q. We claim that if & is large enough, the image of 
Q;, must stay in H, proving that u and v can be connected by a path in S. 

Let D denote the diameter of f(P). Let u; be the last node on the path P,, such 
that the distance of f(u1) from the x-axis is at most D+ 2c?, let P/ be the piece 
of P, between u and uy, and let P’’ = P,,\ P’. We define P’!, P’, etc. analogously. 

Suppose that f(P’’) intersects f(P!’), say edge ij of f(P’’) intersects edge 
ab of f(Pi). By (7.14), the length of the image of any edge is at most c, so 
| f (i) — f(a)| < 2c, and so dg(i,a) < 2c?. Thus there exists a path R of length at 
most 2c? in G connecting i to a. Again by (7.14), the diameter of f(R) is at most 
2c?. By the definition of P”, the distance of f(i) from the x-axis is more than 2c?, 
so f(R) cannot cross the x-axis. It follows that u and v can be connected by a path 
whose image stays in the upper halfplane, using paths P,, R, and Py. 

So we may assume that f(P’’) and f(P.’) are disjoint. Let T be the set of all 
nodes in G at a graph-distance at most cz from PUP! UP! UP*,. Since T is a finite 
set, there is a k for which Q, does not intersect T. By we get that for every 
node s of Q, and every node t of P, 


ls)— f(b] = ED > 9 Lo, 
c c 
In particular, f(s) cannot be in the convex hull of f(P). 

If f(Qx) does not intersect the lower halfplane, then we are done. Suppose 
it does, then either it intersects f(P’,) or else it contains a subpath Qj, such that 
f(Qj,) lies in the upper halfplane and intersects both f(P/’) and f(P,’). 

Suppose that f(Q,) intersects f(P.). Similarly as above, we find a path R of 
length at most 2c? in G connecting a node a on P,, to a node i on Q x. This path 
must intersect the path PU P,,U P, at some node z; this means that dg(z, a) < 2c’, 
and so | f(z) — f(a)| < 2c?. But f(z) is either in the upper halfplane or at a distance 
at most D from it, and so f(a) is at a distance at most 2c?+D from the upper 
halfplane. So a € V(P",) and hence i € T, a contradiction, since Q;, avoids T. 

Finally, if there is a path Q such that f(Q) lies in the upper halfplane and 
intersects both f(P/’) and f(P”’), then similarly as above, we find two paths R,, 
and R, of length at most 2c? connecting Q to P” and P”’, respectively. Similarly 
as above, these paths must stay in the upper halfplane, and so again we find that u 
and v can be connected by a path staying in the upper halfplane through P,,, Ry, 
Q, R, and P,. This completes the proof of the Claim. 


The assertion that every node is in the center of gravity of its neighbors is just 
a restatement of the fact that every analytic function is harmonic. This also shows 
that (assuming that f gives an embedding) no country can have a concave angle, 
and so the countries are convex polygons. So the main step is to show that f defines 
an embedding. 

We start with observing that the image of every edge of G is a segment of 
length at most c, and for any two nodes u and v of G (adjacent or not) the distance 
of f(u) and f(v) is at least 1/c. 
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Now we turn to the proof that f defines an embedding. Let us triangulate each 
country of G arbitrarily, to get a new graph G,. Let us draw the images of these 
new edges as straight segments. We claim that even with these new edges, f defines 
an embedding of G1. 

It is enough to show that (a) two triangular countries of G; sharing an edge 
xy are mapped onto triangles on different sides of the line f(x) f(y), and (b) the 
images of triangular countries incident with the same node x cover a neighborhood 
of f(x) exactly once. We describe the proof of (a); the proof of (b) is similar. 

So suppose that xyz and xyw are two triangular countries of G;, and that f(z) 
and f(w) are on the same side of the line L through f(x) and f(y), say on the 
right side. By the Lemma, there is a path P in G connecting z and w whose image 
under f stays on the right side of L. Since x is mapped to the center of gravity of 
its neighbors, there is a node x’ adjacent to x in G such that f(z’) lies on the left 
side of L, and similarly, y has a neighbor y’ such that f(y’) lies on the left side of 
L. Again by the Lemma, there is a path Q in G connecting x’ to y’ such that the 
image of Q stays on the left side of L. Extend Q with the edges xa’ and yy’ to get 
a path Q’. 

Now obviously P and Q’ are node-disjoint paths. But if we consider them 
in the planar embedding of G, it is clear that they must cross each other. This 
contradiction proves the theorem. 


7.2.3. Geometric representations and discrete analytic functions. 
Harmonic functions are closely related to geometric representations, as we have 
seen, among others, in rubber band representations. In the setting of analytic 
functions, this connection is even richer. 


Square tiling. We have described the connection between square tilings and har- 
monic functions on planar graphs, as discovered by [Brooks et_al. 1940], in Section 
Here we consider the closely related structure of tiling the whole plane with 
squares, whose sides are parallel to the coordinate axes. Assume that the tiling is 
discrete, i.e., every bounded region contains only a finite number of squares. 

Points where four squares meet must be “resolved” just as in the finite case. 
Every such point will be either “vertical-through” or “horizontal-through”. Let us 
also call a vertex of a square that is an interior point of a vertical edge of another 
square a “vertical-through”. Also assume that there is no horizontal semiline com- 
posed of edges of the squares that contains no “vertical-through” point. We call a 
horizontal segment basic, if it is composed of edges of squares and its endpoints are 
“vertical-through” points. 

We associate a map in the plane with this tiling just as in Section [6.1] We 
represent each basic segment by a single node (say, positioned at the midpoint of 
the segment). Each square “connects” two basic segments, and we can represent it 
by an edge connecting the two corresponding nodes, directed top-down. We get a 
directed graph G, which is infinite but locally finite. 

It is not hard to see that G is planar. If we assign to each node (basic segment) 
its y coordinate, we get a harmonic function; from this, we get an analytic pair as 
described above. One can give an explicit description of the other function in this 
pair by repeating the same construction using the vertical edges of the squares. It 
is not hard to see that this gives the dual graph, and the x coordinates supply the 
second function of an analytic pair. 
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This construction can be reversed. Starting with a planar map G and an 
analytic pair on it, we can construct a tiling of the plane by squares, where the 
squares correspond to the edges of G (or, equivalently, to the edges of G*, or, 
equivalently, to the countries of G°). 

The following special case is worth mentioning. Let G be a 3-connected simple 
toroidal map, and let us construct its rubber band embedding as in [7.2] We can 
lift this to a periodic embedding of the universal cover map Gin C, where the node 
positions define a harmonic function. We can construct a square tiling representing 
this graph as above. This too will be periodic, so the universal covering mapping 
maps it onto a square tiling of the torus. 


Touching polygons. The ideas in give another geometric interpre- 
tation of analytic functions. 

We need to define one further graph derived from a planar map G: define the 
total map of a planar map as the medial map of its lozenge map. To clarify the 
name: the lozenge map has a node for each node and for each country of G, and a 
country for each edge of G. So its medial graph has a country for each node, edge 
and country of G. 

Suppose that a function f: VUV* — C gives a straight line embedding of G in 
the plane with convex countries, and a straight line embedding of G* with convex 
countries (we consider the infinite case; in the finite case, we should ignore the 
unbounded country). The only condition we need connecting the two embeddings 
is that if uv is an edge of G (with this orientation), and pq is the corresponding 
edge of G* (so that p represents the country Is(uv)), then the line f(p)f(q) crosses 
the line f(u)f(v) from left to right. Let P,, denote the convex polygon representing 
the country of G (or G*) corresponding to u € V* (or u € V)). Shrink each P,, 
from the point f(w) by a factor of 2, to get a polygon P’. (The point f(u) is not 
necessarily in the interior of P,,.) This way we get a system of convex polygons, 
where for every edge uv € G°, the two polygons P, and P, share a vertex at the 
point (f(u)+f(v))/2 (Figure 7.8). 

There are three kinds of polygons in this picture: the dark and light grey ones 
correspond to the nodes in V and V%, respectively, while the white parallelograms 
Q. between the polygons correspond to the edges e € FE. It is easy to prove 
that these are indeed rectangles, and the two vectors spanning the parallelogram 
corresponding to edge e are (f(hd(e)) — f(tl(e))) and $(f(Is(e)) — f(rs(e))). It can 
be shown (using our hypothesis about the relationship between the two straight line 
embeddings) that the interiors of these polygons will be disjoint. Thus this method 
provides an embedding of the total graph of G in the plane. 

It is clear that the function f is analytic if and only if every parallelogram Qe 
is a square. If the parallelograms Q, are rectangles, then we can weight every edge 
e € E with the ratio of the sides of Q.. The function f will be analytic on this 
weighted graph. 

This correspondence with analytic functions is nice enough, but there is more. 
Let us take another analytic function g on this edge-weighted graph, and construct 
the polygons g(u)P’ (multiplication by the complex number g(u) corresponds to 
blowing up and rotating). The resulting polygons will not meet at the appropriate 
vertices any more, but let’s try to translate them so that they do (Figure[7.9). The 
computation is quite simple. Let h(u) be the vector with which we try to translate 
g(u)P!, where (say) u € V. Let p be a country of G incident with u. Then the 
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FIGURE 7.8. Left: the total graph of a planar map G, obtained 
from straight-line embeddings of the map and its dual (only the 
neighborhood of a single edge is shown). Right: similar representa- 
tion of the graph in Figure[7.5| Note that since the node positions 
of the original graph define an analytic function, the white regions 
representing the edges are squares. 


image of u in the image of P, is 


Fw) £10) + h(p), 


while the image of p in the image of P,, is 


These two points must be the same, which leads to the equation 


FO* LP) (op) — g(u)) = h(u) h(n). 


Recall that f and g are given, and we want to find a function A satisfying this 
equation for every edge up of G°. Functions h satisfying this condition are precisely 
of the form 


no) = f f dg +const 


(with an arbitrary but fixed starting node u). We have seen that this integral is 
well defined for any two analytic functions. If g is, in addition, critical, then this 
function fA is a third analytic function defined by this representation of the total 
graph. 

It is not hard to see that conversely, every “deformation” of the total graph 
picture such that the polygons P, remain similar to themselves defines an analytic 
function on G. 


Circle packings. We can also relate discrete analytic functions to circle repre- 
sentations. Let G be a 3-connected toroidal map. It is again best to go to the 
universal cover map G. Then we can construct two (infinite, but discrete) families 
F and ¥* of circles in the plane so that they are double periodic modulo a lattice 
L=Za+Zb, F (mod L) corresponds to the nodes of G, F* (mod L) corresponds 
to the countries of G, and for every edge e, there are two circles C, C’ representing 
hd(e) and tl(e), and two circles D and D’ representing rs(e) and Is(e) so that C,C"’ 
are tangent at a point p, D,D’ are tangent at the same point p, and C,D are 
orthogonal. (This can be proved along the lines of the proof of Theorem |5.3}) 
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FIGURE 7.9. The deformation of the touching polygon represen- 
tation given by an additional analytic function. 


If we consider the centers of the circles in F as nodes, and connect two centers 
by a straight line segment if the circles touch each other, then we get a straight 
line embedding of the universal cover map in the plane (periodic modulo L). Let 
Xy denote the point representing node v of the universal cover map or of its dual. 

To get a discrete analytic function out of this representation, consider the plane 
as the complex plane, and define f(u) = x, for every node u € VUV*. By the 
orthogonality property of the circle representation, 


f(hd(e)) — f(tI(e)) = iw(e)(F(rs(e)) — F(Is(e))) 
with an appropriate w(e) > 0. It follows that if we consider the map G with weights 
w(e), then f is a discrete analytic function on this weighted map. 


For more on the subject, see the book |Stephenson 2005] and the papers 
Mercat_2002], |Mercat 2004}. 


7.3. Discrete holomorphic forms, a.k.a. rotation-free circulations 


Let us return to finite maps, but this time on surfaces other than the plane. 
In this section, it will be convenient to allow the graphs to have parallel edges 
and loops. While no nonconstant analytic function can be defined on any finite 
graph, we can do a lot better if we consider functions on the edges, analogous to 
derivatives of “true” analytic functions, namely holomorphic forms. We have to 
start with some topological considerations concerning maps on surfaces. 


7.3.1. Maps, boundaries and coboundaries. Most of the basic definitions 
concerning planar maps carry over without any difficulty to graphs embedded in 
other surfaces. A point to pay attention to is the definition of a proper map on a 
compact surface S. By this we mean a finite graph G embedded in S so that each 
country is homeomorphic to an open disk. Once we have this property, the notions 
of the cyclic sequence of edges forming the boundary, the dual map, the lozenge 
map, etc. can be defined. If the surface is orientable, then we can define for each 
edge a right shore rs(e) and a left shore Is(e) just like in the plane. For a proper 
map on a compact surface, Euler’s Formula generalizes as 


where x is the Euler characteristic of the surface. For a compact orientable surface 
of genus g (a sphere with g handles), one has x = 2—2g. 
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Let G be a finite directed map on an orientable surface. For every face F' € V*, 
we denote by Or € R” the boundary of F: 


1, if ls(e) =F, 
(Or)le= 4-1, ifrs(e) =F, 
0, otherwise. 


Then |0F|? is the length of the cycle bounding F. Note that OF is just the spe- 
cial circulation fo defined in (B.2), where C' is the boundary cycle of F' with the 
counterclockwise orientation. 

This construction is dual to vector 6, and the 0-dimensional coboundary oper- 
ator, introduced in Appendix [B.3| (those can be defined for every graph, not only 
for maps on surfaces). Similarly as there, we can consider the boundary operator 
as an Fx V* matrix D = Dg, where Dlr = Of for every country F € V*. Its 
transpose D' € RY" ** is the coboundary operator from dimension 1 to dimension 
2, with the rather simple action of D'l, = Lise) — lrs(e)- It is easy to check the 
fundamental relations 


(7.16) B'D=0 and D'B=0. 


(In topology, one would denote both coboundary operators B and D' by 6 and 
both boundary operators B' and D by 0, so could be written as 6? = 0 and 
6? = 0. In the simple cases we need, it will be more convenient to distinguish the 
boundary and coboundary maps in different dimensions.) 

Some basic facts about various subspaces defined by the coboundary operator 
B are summarized in Appendix [B.3] If the graph is embedded in a surface, then 
we get a little more interesting spaces, relating the graph with its dual. (This still 
belongs to the almost-trivial part of algebraic topology.) 

As a warmup, consider a (finite) 2-connected planar map G. Then the space 
of circulations H and its orthogonal complement A, the space of potentials, are 
related by map duality. Mapping each edge to the corresponding dual edge defines 
an isomorphism ¢: R” + R®’. It is not hard to verify that ¢(Ag) = Hg and 
(Hg) = Ag+. Boundary cycles of bounded countries of G form a basis of Hg. 
So dim(Ag«) = m—n+1; recalling that dim(A) = n—1, we see that this is just a 
restatement of Euler’s formula. 

We get a richer structure if we assume that G is embedded in an orientable 
surface of higher genus. Let © be a closed compact orientable surface of genus g, 
and suppose that the directed graph G is embedded in © as a map. Let f = |V*| 
be the number of countries. We say that a vector 2 € R® is rotation-free, if for 
every country p the equation 


(7.17) aS Se 

e:ls(e)=p e:rs(e)=p 
holds. We can express this as x | 0,. This is also equivalent to saying that the 
vector x defines a circulation on the dual graph. 

Each vector Op (p € V*) is a circulation; circulations that are linear combina- 
tions of these special circulations are called null-homologous. Two circulations z and 
z' are called homologous if z—z’ is null-homologous. Null-homologous circulations 
form a linear space B = Bg C R® generated by the vectors Op. 

We have defined two orthogonal linear subspaces of R”: the space A of po- 
tentials and the space B of null-homologous circulations. Interchanging the roles 
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of the map and its dual, the roles of A and B are interchanged. The orthogonal 
complement At = H is the space of all circulations, and B+ is (under the identifi- 
cation of F and E*) the space of circulations on the dual graph. The intersection 
C = AtnB+t = (A+B)* is the space of rotation-free circulations. This subspace 
will be our main object of study in this section. 

The orthogonal decomposition 


(7.18) R® = A@BOC 

can be thought of as a discrete version of the Hodge decomposition from differ- 
ential geometry. We know that dim(A) = n—1 and dim(B) = dim(Ac«) = f—-1. 
Counting dimensions, we can express the dimension of C by the genus of the surface: 
(7.19) dim(C) = m—dim(A) —dim(B) = m—(n—1)-—(f-1) = 2g. 

In particular, we see that for maps on the sphere, the space C is the null space: no 
nonzero circulation can be rotation-free. 

Example 7.5 (Toroidal grid). Figure shows a rotation-free circulation on 
the toroidal grid. The numbers 5 and 17 can be replaced by two arbitrary numbers, 


so this picture gives a 2-dimensional space of rotation-free circulations. By (7.19), 
the dimension of C is 2, so there are no other rotation-free circulations. 


4 A 4 A rN A 


5 5 5 5 5 5 

17 17 17 17 17 17 
is % * Fr 7% a > 
5 5 5 5 5 5 

17 17 | 17/7) 17 17 17, 
5 5 5 5 5 5 

17 17} 17 | 17 ~ 7 #17 d72.,. 
5 5 5 5 5 5 

17 A LL 17. 
5 5 5 5 

17 17 17 7 /° 17 : 17 
ic i <i oe he * > 


FIGURE 7.10. Rotation-free circulation on the toroidal grid. 
(Edges leaving on the top are identified with edges entering on 
the bottom, and similarly for right-left, so that we have a graph 
embedded in the torus.) 


7.3.2. Rotation-free circulations and discrete analytic functions. Let 
z € R® bea rotation-free circulation on a graph G embedded in a surface. Consider 
a family F’ of countries whose union is homeomorphic to the disk, and let G’ = 
(V’, E’) be the graph formed by the boundary edges of these countries. Every 
cycle in G’ is null-homologous, and so rotation-freeness and the potential argument 
(Corollary |4.5) give us a function f : V’ + R such that z = f(hd(e)) — f(tl(e)) 
for every edge e € E’. Similarly, the flow condition implies that there is a function 
g: F' — Rsuch that z. = g(rs(e)) —g(Is(e)) for every edge e between two countries 
in F’. (To see the analogy with complex function theory better, we shifted our 
notation to functions on node sets rather than vectors indexed by nodes.) 
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The functions f and g defined above satisfy 


(7.20) f(tl(e)) — f (hd(e)) = g(rs(e)) —g(Is(e)) 


for every edge between two countries in F. We can think of f and g as the real and 
imaginary parts of a (discrete) analytic function, and can be thought of as a 
discrete analogue of the Cauchy—Riemann equations. 

It is important to emphasize that these functions f and g exist only locally, 
in the planar piece of %; but this is not surprising, since “true” analytic functions 
may have well-defined values only when restricted to a simply connected region. 
However, we can use the trick of Riemann surfaces from complex function theory. 
For a map G on a closed orientable surface © different from the sphere, we consider 
the universal cover mapping ¢: R? — X. Rolling up the plane twice to get a torus 
is an example. 

The graph G = 671(G) = (V,E,F) is called the universal cover of G. This 
is an infinite graph embedded in the plane, and it is a planar map in the sense 
of Section Furthermore, it is invariant under the action of an appropriate 
discrete group of isometries of the Euclidean plane (in the case of the torus) or of 
the hyperbolic plane (in the case of higher genus). The universal cover of the dual 
map G* is the dual map G*. 

Any rotation-free circulation on G corresponds to a rotation-free circulation on 
G. This circulation can be represented as 4. = f (hd(e)) — f(tl(e)), where f : V > 
R. Note that the function f can not be obtained by lifting a function on V. 
Similarly, we obtain a function g: V(G*) > R such that a. = g(rs(e)) —g(Is(e)). 
The pair (f,g) is an analytic pair. 


FIGURE 7.11. Left: A rotation-free circulation on the torus (upper 
left corner) and a corresponding harmonic function on the universal 
cover map. Right: A corresponding harmonic function on the dual. 


Figure[7.11|shows a rotation-free circulation on a graph embedded in the torus. 
The first figure shows an associated harmonic function on the nodes of the universal 
cover map, the second, a harmonic function on the dual map. 

We can introduce these basic notions in a little larger generality: we associate 
a positive weight w(e) with every edge. In most of this chapter, we restrict our 
discussion to the unweighted case, because extending this to the weighted case 
would not amount to much more than inserting ‘w(e)’ at appropriate places in the 
equations. 


110 7. DISCRETE ANALYTIC FUNCTIONS 


7.3.3. Nondegeneracy properties. We state and prove two key nondegen- 
eracy properties of rotation-free circulations. We start with a simple lemma about 
maps on surfaces. For every country p, let rev(p) denote the number of times the 
orientation of the edges changes if we move along the boundary of p. For every 
node v, let rev(v) denote the number of times the orientation of the edges changes 
in their cyclic order as they emanate from v. 


Lemma 7.6. Let G = (V,E,V*) be a proper directed map on a surface of Euler 
characteristic x. Then 
S> (rev(u) —2) = —2x. 
ueVUV* 
Proof. Every corner means an orientation change either for the node or for 
the country defining it. Hence 
S- rev(u) = S- deg(v) = 2m = 2n+2f —2y. 
ueVUV* veV 
Rearranging, we get the equality in the lemma. 


The first nondegeneracy property |Benjamini—Lovasz 2003] concerns certain 


special rotation-free circulations, namely those obtained as projections of basis vec- 
tors. Let UV denote the orthogonal projection of R” to the space C of rotation-free 
circulations. 


Theorem 7.7. Let G be a 3-connected proper map on an orientable surface with 
genus g >0. Then V1. £0 fore € E. 


Note that U1. 4 0 implies that 
(7.21) (U1.). = 1)W1, = 1/071, = |W1,|? > 0, 
so a (seemingly stronger) conclusion of the theorem could be that the vector V1, 
is positive on the edge e. In a similar fashion, for any c € C, we have 
Ge = We = 1)(Ge) = c'(F1,), 


so it would suffice to prove that not every rotation-free circulation vanishes on e. 

For g = 0 there is no nonzero rotation-free circulation, and hence V1, = 0 for 
every edge e. So the hypothesis that g > 0 cannot be omitted. The assumption that 
G is 3-connected cannot be dropped either, as shown by the example in Figure[7.12] 
Here every rotation-free circulation is 0 on e, and hence V1, = 0. The orientability 
of the surface is also important (Exercise [7.17). 


FIGURE 7.12. Every rotation-free circulation is 0 on the edge e. 
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Proof. Suppose that V1, = 0 for some edge e. This means that 1. € A+B, 
and so we have a decomposition 1, = a+b, where a € A and b € B. So we have 
ar+by =0 for every edge f Ae, but af+by =1 for f =e. By the definition of A 
and B, there are vectors x € RY and y € RY” such that a = Bx and b = D'y. We 
may choose the reference orientation so that a > 0. 

Let S = {yp: p © V*}. For c € S, let U. denote the union of countries p 
with yp > c. The boundary of U, is an eulerian subgraph D, of G. Let D be a 
cycle contained in D,. For every edge f € E(D)\{e} we have ay > 0, and all these 
edges are oriented in the same way around the cycle. So 2; strictly increases as we 
traverse the cycle D. This can only happen if e € E(D), and since this holds for 
every cycle in D C D,, the whole subgraph D, is a single cycle through e. It also 
follows that all the values x; on D, are different, and they grow strictly from one 
endpoint of e to the other. 

Let Go denote the subgraph formed by those edges f for which ay = 0, and Gy; 
the subgraph formed by the other edges. Since x; is constant on every connected 
component of Go, any cycle D, meets any component of Go at most once. 

Let a < b be two consecutive numbers in S, then the boundary of U,\Usz is 
contained in D,UD»,. A connected component of Gp can have at most two nodes 
in common with D,U D,; hence, by 3-connectivity, this connected component must 
consist of a single edge that is contained in U, \U»,, connecting D, and Dp. 


FIGURE 7.13. Level sets of the vector y, and the neighborhood of 
a node v 


This argument implies, in particular, that every node is contained in at least 
one cycle D,. Let v 4 hd(e), tl(e) be any node, and let a € S and b € S be smallest 
and largest values for which v € V(D,) and v € Dy (Figure (7.13). Let c < a and 
d > bbe a nearest numbers in S' (we assume that these numbers exist; the opposite 
case can be settled similarly). The edges of the cycles D, (a < r < 6) entering v 
are consecutive in the cyclic orientation about v, and so are their edges leaving v. 
Furthermore, Go can have at most two edges adjacent to v, one connecting v to 
D-, and one to Dg. No matter how these two edges are oriented, the edges incident 
with v form two consecutive sequences, one entering and one leaving v. In other 
words, 


(7.22) wei 0, ifv= Hl) or v = hd(e), 
2, otherwise. 


A similar argument shows that rev(p) = 2 for every country p. Substituting in 
Lemma [7.6] yields —4 = —2y, so x = 2, and thus the surface is the sphere, a 
contradiction. 
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Considering a “generic” linear combination of the vectors V1. (e € FE), we 
obtain the following corollary. 


Corollary 7.8. Every 3-connected proper map on an orientable surface with genus 
g > 0 carries a rotation-free circulation that does not vanish on any edge. 


The arguments above also yield a more explicit formula for V1.¢. 


Proposition 7.9. Let G be an oriented map on an orientable surface 4, and let 
e€ E. Let x € RY be the vector of voltages of the nodes, if a unit current is sent 
from tl(e) to hd(e), and let y€ RY” be defined analogously for G*. Then 


Wl, =1.—Bar—Dy. 
Proof. We have the decomposition 1. =a+b+cwithae A,be€ BandceC. 
Using the definitions of A, B and C, this can be written as 
1. = Br+Dy+V1. 


for appropriate vectors « € RY and y € RY’. Applying the boundary operator BT’, 
we get 

Tha(e) = Luce) = B'Bza = Lx, 
since Dy and W1, are circulations and hence B'Dy = B'W1, = 0. As noted at the 
end of Section [4.1.3] we can interpret x (and similarly y) as in the Proposition. 


The effective resistance between the endpoints of e = i7 can be expressed as 
R(i,j) = 1! Bz, and similarly the effective resistance of the dual map between the 
endpoints of the dual edge e* = pq is R(p,q) = 1{ Dy. Taking scalar product of the 
formula in Proposition [7.9] with 1., we get 


Using (7.21), we get a corollary for effective resistances: 


Corollary 7.10. If G is map on an orientable surface, then for every edge e, 
R(i,j)+R(p,q) <1. If the surface is the sphere, then equality holds. If the surface 
is different from the sphere, and G is 3-connected, then strict inequality holds. 


The second main nondegeneracy property |Benjamini—Lovasz 2002) tries to cap- 


ture the property of “true” analytic functions that if they vanish on an open disk, 
then they vanish identically. What should correspond to the notion of “open disk” 
in a discrete setting? It is not hard to guess that it should mean a “large enough 
connected subgraph”, but just specifying the number of nodes will not work. 


Example 7.11. Let G be a map on an orientable surface with genus g > 0, and let 
X be a connected induced subgraph of G such that at most 2g “external” nodes of 
X are connected to the rest of G (otherwise, X can be arbitrarily large). Suppose 
that X is embedded in a subset of & of the surface that is topologically a disk. 
Delete the edges of X as well as all the non-external nodes, create a new node 2, 
and connect it to the external nodes (Figure[7.14). 

We get a graph G’ that is still a map on ©. Thus this graph has a 2g-dimensional 
space of circulations, and hence there is a nonzero rotation-free circulation ~ van- 
ishing on each of the 2g—1 of the edges incident with w. Since this is a circulation, 
it must vanish on all the edges incident with x. Delete x, put X back, and extend 
w with O0-s to the edges of X. We get a nonzero rotation-free circulation vanishing 
on X. 4 
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FIGURE 7.14. Contracting a loosely connected piece 


This example suggests that the right thing to consider is not the size of the 
zero-region but its node-boundary. The following theorem asserts that this is indeed 
so, and the number 2g in the example is best possible up to a factor of about 2. 


Theorem 7.12. Let G be a graph embedded in an orientable surface i as a map. 
Let yp be a nonzero rotation-free circulation on G and let G’ be the subgraph of G 
on which p does not vanish. Suppose that yp vanishes on all edges induced by, and 
incident with, a connected subgraph U of G. Then U can be separated from G" by 
at most 4g—3 nodes. 


Proof. In our topological arguments, we have to be careful not to use intuition 
based on planar maps. In particular, we may have edges both sides of which belong 
to the same country (even though the graph is 3-connected). We call such an edge 
non-bordering. 

Let W be the connected component of G\ V(G’) containing U, and let Y denote 
the set of nodes in V\V(W) adjacent to W. So Y C V(G’), every node in Y is 
adjacent to W, and all edges e between Y and W, as well as in G[W], have y(e) = 0. 

Consider an edge e with y(e) = 0. If e is not a loop, then it can be contracted 
to get a map on the same surface with a rotation-free circulation. In particular, we 
can contract W to a single node u (Figure[?.15). 


FIGURE 7.15. Contracting a connected component of zero edges 


If e is a loop with Is(e) ¥ rs(e), then G\e is still a map (the interior of every 
country is topologically an open disk), and ¢ is a rotation-free circulation on it. 

So we can eliminate an edge with y(e) = 0 unless it is a non-bordering loop. In 
this latter case, we can change y(e) to any nonzero value and still have a rotation- 
free circulation. 
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We apply this elimination procedure to all edges with y = 0 except for the 
edges between u and Y, and between two nodes of Y. We don’t try to contract 
edges between nodes in Y (we do not want to reduce the size of Y), but we can 
delete them unless they are non-bordering. If there are parallel edges between u 
and Y, and at least one of them is a border between distinct countries, then we 
delete this too. So at the end we may assume that every edge between u and Y is 
either single, or non-bordering. Let D be the number of edges between u and Y. 

Re-orient each edge with y ¥ 0 in the direction of the flow y, and orient the 
edges between u and Y alternatingly in an out from u (there may be one pair of 
consecutive edges in the same direction, if D is odd). Orient the edges with y = 0 
between two nodes of Y arbitrarily, to get a digraph G1. 

It is easy to check that G; has no sources or sinks, so rev(v) > 2 for every 
node v, and of course rev(u) > D—1> |Y|—1. Furthermore, every country either 
has an edge with y > 0 or a non-bordering edge on its boundary. In the first 
case, it cannot be bounded by a directed cycle, since y would add up to a positive 
number on its boundary. In the second, the country boundary goes through the 
non-bordering edge twice in different directions, so again it is not a directed cycle. 
Thus we have rev(p) > 2 for every country p. 

Substituting in Lemma|7.6] (with G, in place of G), we get that |Y|—1 < 4g—4. 
Since Y separates U from G’, this proves the theorem. 


7.4. Global information from local observation 


As an application of the nondegeneracy results above, we describe an algorith- 
mic problem; I don’t think this problem has any important applications, but it is 
perhaps interesting for its unusual setting. Suppose that we live on a (finite) map 
on a compact orientable surface with genus g (the embedding is reasonably dense). 
Can we determine global properties of the graph, say the genus of the surface, if 
we only have access to a small neighborhood of our location (Figure [7.16)? 


FIGURE 7.16. A map observed at the black nodes. Can you de- 
termine the genus of the surface? 


Well, clearly not. But suppose that the graph is “active”, in the sense that the 
edges carry weights, which change according to random local transitions, and we 
can observe the process in that small neighborhood for some time. Let us describe 
a particular process that allows us to determine the genus of the graph, based on 
[Benjamini-Lovasz 2003] 

Discrete holomorphic forms motivate a reasonably natural and simple proce- 
dure, which we call random balancing. Informally, this can be described as follows. 
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We consider a finite map G, with a reference orientation, on an orientable surface 
S. Each edge carries a real weight 7;;. With some frequency, a node wakes up, and 
balances the weights on the edges incident with it, so that locally the flow condition 
is restored. With the same frequency, a country wakes up, and balances the weights 
on the edges incident with it, so that the rotation around the country is canceled. 
To make this precise, we start with any vector 2 € R”. At each step, one of 
the following two operations is carried out on the current edge-weighting x € R®: 


(a) NoDE BALANCING. Let v € V and let s, = (6,)'x be the “imbalance” at 
node v (the value by which the flow condition at v is violated). We modify « by 
subtracting (s,/deg(v))6d, from it. 


(b) CouNTRY BALANCING. Let p € V*, and let rp = (Op) 'x be the “rotation” 
around p. We modify x by subtracting (ts / deg(p)) Op from it. 

Rotation-free circulations are invariant under node and country balancing. 
Starting with any vector x on the edges and doing country and node balancing 
of randomly chosen nodes and countries, from the uniform distribution on VUV%, 
the weights will converge to Ux quite fast (Exercise [7.18). 

How do we observe the procedure? Let U be a connected subgraph of G, which 
cannot be separated from any non-zero-homotopic cycle by fewer than 4g —1 nodes. 
Let Eo be the set of edges incident with U (including the edges of U). We think of 
Eo as the “window” through which the graph G can be observed. 

Let 'x denote the restriction of c € R® to Eo. By Theorem [7.12| dim(T'C) = 
dim(C) = 2g. For any x € R®, the vector z = Wz is in C, and T'z can be observed 
in the window. If we can repeat this with sufficiently many starting edgeweights 
x',...,2™ so that their projections z* = Uz’ span C, then T'z!,...,Tz™” span 
TC by Theorem and so determining the dimension of the space spanned by 
Tz!,...,0z™ tells us the genus of the surface. 

How to generate such starting vectors x‘? Since by our model we have access 
to the edges in Ey = {€1,...,@m}, it is natural to restart from x? = 1.,+2*~1. (For 
i = 1, we start from 1,,.) We call this a local excitation. 

Do local excitations yield sufficiently many linearly independent vectors I'z*? 
Luckily, they do, as shown by the next lemma. 


Lemma 7.13. The vectors U1, (e € Eo) generate C. 


Proof. Suppose not, then there is a nonzero vector v € C orthogonal to every 
vector V1,. Then 
v'ie= (vy) 1, =(Vil,)'v=0 
for every edge e € Eo, which means that 'v = 0. By Theorem|7.12} it follows that 
v = 0, a contradiction. 


The more difficult issue is: how to compute z = Va from x? Starting with 
ro = 2, let a, € R” be the vector of edgeweights after t random balancing steps. 
If z = Wao, then x; > z as t > oo, and sola, — Tz. 

Of course, after a local excitation we cannot wait for infinite time to let the 
system settle to a rotation-free circulation. We have to make an observation of the 
window Eo allowing sufficient but finite time T’ after each local excitation. These 
observations provide vectors y1,.--,%m € R” such that y;—yi_1 © PW, (here 
Yo = 0). If we had exact equality here, then the rank of (y1,...,Ym) would be 


exactly 2g, by Lemma and Theorem 
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But we have errors! We can do the following: we construct a linear subspace 
= (approximating ['C) as follows. Start with %) = 0. For i = 1,...,m, check 
if the distance of y;—y;-1 from %4;_, is at least a pre-defines « > 0. If so, let 
Y; = lin();_-1U{y;}), else let 4; = 4;_1. Return the dimension of ¥,,,. Let us call 
this procedure subspace fitting. 


Theorem 7.14. Let ¢ = (n+ fy- Ory”, T = (n+f)", and let k be the result of 
the subspace fitting algorithm. Then with high probability, g = k/2. 


The proof of this theorem involves some rather tedious computation, analogous 


to that in |Benjamini—Lovasz 2002], and will not be reproduced here. For the 
interested reader, we state the main steps as Exercises and 


Remark 7.15. We defined balancing steps as a node/country chosen uniformly 
from V UV*. While this is perhaps easier to describe, it has an unpleasant feature, 
in particular if we think of the map G as very large: random choice for the site 
of the balancing step is a global operation. The following (essentially equivalent) 
model is therefore more appealing: Let us endow each node and each country 
with a “Poisson clock”, emitting ticks at times according to a Poisson process with 
parameter 1. When its clock ticks, the site balances. Thus the model becomes truly 
local. Unfortunately, the waiting time in the algorithm still depends (polynomially) 
on the size of the graph; this is natural, since information from the distant parts of 
the map must get to the observation window. 


Remark 7.16. In a different method for local excitation 
was analyzed: In that version, we do not manipulate the edgeweights in the window, 
but (besides node and country balancing) we allow a third random change with some 
small probability p: 


(c) ExciTaTion. Add arandom number X € {—1, 1} to the weight of a random 
edge e € E. 


The main result is that there is a constant c > 0 such that if p < 1/(n+f)°, 
then observing the modified balancing process for at least (n+ f)°/p steps, we can 
determine the genus g with high probability. 


Exercise 7.1. (a) Prove that if we assign a complex number to every integer point 
on the real and imaginary axes, then there is a unique analytic function on the 
Gaussian integers extending these values. (b) Prove that if we assign a complex 
number to every Gaussian integer on the diagonal x = y in the plane, then there 
is a unique strongly analytic function on the Gaussian integers extending these 
values. 


na 


Exercise 7.2. Let f be an analytic function on G. Prove that f(a+iy) = 
(—1)**" f(a +iy) is analytic on G. 


Exercise 7.3. Prove that the restriction of any discrete analytic function to the 
(tilted) grid Go is a (complex valued) harmonic function on this grid graph. Also 
prove that every harmonic function on the grid graph Go arises as the restriction 
of a discrete analytic function. 


Exercise 7.4. (a) Prove that if f is an analytic function on the grid, then his f 
is an analytic function of u. (b) If f is a polynomial in Re(z) and Im/(z), then 
Jo f is a polynomial in Re(u) and Im(u). 
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Exercise 7.5. For u € G, let u® = 1, and for n > 1, define 


u 
nm: 1: 
u =n fz dz. 
0 


Define a discrete polynomial of degree n as any linear combination of the discrete 
analytic functions 1, z, z'2',...,z'™. Prove that the space of polynomials of a given 


degree is invariant under shifting the origin |Mercat_2002}. 


Exercise 7.6. Let f be an analytic function on the grid, and let a be a nonzero 
Gaussian integer. Prove that the following functions are discrete analytic: 
(a) Oa f(z) = (f(z+a)—f(z))/a. 

f(z) if z € Go, 


ie {ie if 2 € Gx. 


Exercise 7.7. Define the discrete exponential function as a discrete analytic func- 
tion Exp(x) on G satisfying 


[ f (z)dExp(z) = [ f(2)Exp(z)dz. 


for every discrete analytic functions f and every u,v € G. 
(a) Prove that there is a unique exponential function. 
(b) Prove that 


eo ym 
Exp(z) = > “nl? 
n=0 . 


for values of z for which the series on the right hand side is absolute convergent. 
(c) Prove that exponentials form a basis for all discrete analytic functions 


Mercat 2004). 


Exercise 7.8. Let ¢, \, 0, let Q be a simply connected bounded open set in 

the plane, and let gn : QN(EnG) — Z be a function that satisfies the discrete 

analycity condition on every square contained in 2. We extend the definition of gn 

to every point in 2, as the value at the nearest point of NN (e,G). Let g: QZ, 

and assume that gn — g uniformly on every compact subset of 2. Prove that g is 

analytic (in the classical sense). 

Exercise 7.9. Prove that every analytic function on a lattice domain can be 

written as f1+7f2, where f; and f2 are strongly analytic functions. 

Exercise 7.10. Let g be a strongly analytic function on a lattice domain graph 

G (with the cyclic orientation). For an internal edge uv, define g(uv) = g(u)+ 

(u—v)g(u). 

(a) Prove that g(uv) = g(v) + (@—)g(v). 

(b) Prove that g(uv) is the orthogonal projection of g(u) onto the line parallel to 
Uv. 

(c) Prove that the function |g| satisfies the flow condition at every internal node 

of G*. 

(d) Show that one can define a function H : V(G*)\oo > R such that if uv € E, 

p = |s(uv) and q = rs(wv), then H(q) — H(p) = |g(uv)|?. 

(e) Prove that H is superharmonic on the lattice graph of Go. 

Exercise 7.11. Let G be a finite connected planar map. Let us assign values 

fo(u) € C to the nodes of the unbounded country poo. (a) Prove that there is a 

pair of functions f: V —>Candg: V*\po —C such that f extends fo and 

(7.7) is satisfied at every edge except for the edges of poo. (b) Prove that the pair 

(f,g) is uniquely determined up to adding a constant to g. 
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Exercise 7.12. Let G be a connected planar map with a reference orientation 
and let v € V such that v is not a cutpoint. Let us assign values go(wv) € C to 
edges exiting v so that )7,<y(,) go(uv) = 0. Prove that there is a unique extension 
g: EC of go that is a circulation on G and rotation-free on G\v. 


Exercise 7.13. Prove that every circulation on a map on any orientable surface 
is homologous to a unique rotation-free circulation. 


Exercise 7.14. Let G be a map on an orientable surface and e € E. Prove 
that if there is a any rotation-free circulation that does not vanish on e, then V1. 
is such a circulation. 

Exercise 7.15. Show by an example that there is a 3-connected map G on the 
torus and a nonzero vector f € R” supported on two edges such that Uf = 0. 


Exercise 7.16. Let G be a map on an orientable surface S of genus g > 0, such 
that every noncontractible Jordan curve on the surface intersects the map in at 
least 4g points. Prove that if a rotation-free circulation vanishes on all edges of a 
non-0-homologous cycle and on all edges incident with it, then it is identically 0. 


Exercise 7.17. Rotation-free circulations can be defined on non-orientable closed 
surfaces as well. (a) Prove that there is no nonzero rotation-free circulation on 
any map on the projective plane. (b) Prove that on a map on a non-orientable 
surface with Euler characteristic x, the dimension of the space of rotation-free 
circulations is 1—y. (c) Show by an example that Theorem does not remain 
valid on nonorientable surfaces. 

Exercise 7.18. Let G be a map on an orientable surface. 


(a) Prove that starting with any vector on the edges and doing country and node 
balancing repeatedly so that each node and each country gets balanced infinitely 
often, the edgeweights converge to a rotation-free circulation. 
(b) For u € VUV* and a € R®, let ©, denote the result of node/country balancing 
at u. Let wi,u2,... be the sequence of independent uniform random elements 
of VUV*. Starting with any vector « € R”, define a sequence of vectors by 
recurrence as follows: xo = x and xv; = ®y,2+~-1 (t > 1). Prove that 
2 1 t 2 

E(|or— Wal?) < (1 maa (2/2. 
Exercise 7.19. Prove that if G is a map on an orientable surface with genus 
g > 0, then (W1.). >n-"f-F for every edge e. 


CHAPTER 8 


Discrete Analytic Functions: Statistical Physics 


There are several models in geometric probability and statistical physics which 
are defined on the part of a lattice in a domain 2 in the plane, and which show 
a fully unexpected invariance under conformal transformations of Q, at least in 
the scaling limit, when the edge-length of the lattice tends to zero. This fact (in 
the cases when proved at all) is not only unexpected, but highly nontrivial. It is 
perhaps not surprising that discrete analytic functions play a substantial role in 
several of these proofs. 

We describe four such models, and show the discrete analytic functions (or 
similar functions) that arise from them; then we sketch the proof in one of the 
cases. 


8.1. Conformal invariance 


8.1.1. Domino tilings. A domino is the union of two adjacent squares. A 
lattice domain D may or may not be the union of disjoint dominos; the ques- 
tion is equivalent to the existence of a perfect matching in the tangency graph 
Gi. A necessary and sufficient condition for this can be obtained by special- 
izing the Marriage Theorem, but more geometric conditions can be formulated 


Fournier 1995)|Kenyon 2009}. 


FIGURE 8.1. Left: a domino tiling of a lattice domain and the 
corresponding perfect matching in the dual graph. Right: the 
height function of this domino tiling, in grayscale. Note that the 
edges of the matching correspond to rifts (changes by more than 
1) in the height function. 


A construction capturing important features of domino tilings was introduced 
in [Thurston 1989]. Given a covering by disjoint dominos, we assign a value of 
1 to each edge on the boundary of a domino and —3 to the edge separating two 
squares on the same domino. It is trivial to see that this assignment is rotation-free 
with respect to the cyclic orientation, and so (by the potential argument) it can 
be represented as dh, for some vector h € RY. The function h is called the height 
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function of the domino tiling. It is uniquely determined up to adding a constant 
function; to be specific, we set the function value 0 on the leftmost tile among the 
lowermost tiles. In terms of Gy, the function is defined on those squares containing 
a node of V, and so we can consider it to be defined on the union of these squares 
(Figure|8.1). 

On the boundary of the lattice domain, the height function is uniquely de- 
termined, independently from the domino tiling. On the other hand, the height 
function determines the domino tiling: a pair of squares forms a domino if and only 
if they share an edge along which the height function changes by 3. 

Perhaps the first mathematical result about conformal invariance of the scaling 
limit of combinatorial quantities defined on lattices was obtained by [Kenyon 2000). 
He proved that the variance of the height function is asymptotically conformally 
invariant. This means the following. Fix a point p in the interior of the domain 
Q. Select a tiling 7 of the corresponding lattice domain, then the dual square 
containing p has a certain height h(p). If we select T at random, uniformly among 
all tilings, then h(p) will be a random variable, that has a certain expectation h(p). 
When the edge length ¢ of the grid tends to zero, then the random variable h(p)— 
h(p) tends to a limit random variable g(p). This random variable is conformally 
invariant: If 6 : Q > Q, is a conformal mapping of 2 to another domain 9), 
and g; denotes the function corresponding to g on 0 then g/¢(p)) has the same 


distribution as g(p). See [Kenyon 2009] for a nice introduction to this rich area. 


8.1.2. Honeycombs, triangles and hex. Let G be a 2-connected simple 
planar graph in which all countries are triangles except for the unbounded country 
Poo. Let a,b,c and d be four edges of the boundary of p. (labeled in their coun- 
terclockwise order), splitting it into four node-disjoint arcs ab, bc, cd and da . Let 
us 2-color the nodes with black and white, where we require that nodes in abUcd 
are black and nodes in bcUda are white. Let us say briefly that the boundary is 
properly colored. 

By Proposition and the construction in Figure b), there is either an 
all-black path connecting ab and cd, or and all-white path connecting bc and da, 
but not both. What is the probability that for a random coloring (subject only to 
the requirement that the boundary is properly colored), the first possibility, namely 
ab-cd connectivity occurs? 

This number is quite difficult to compute in general, but there is a beautiful 
asymptotic formula in the case when G is a piece of the triangular grid. More 
exactly, consider the regular triangular grid in the plane, and a finite set of lattice 
triangles whose union is a simply connected region 2. Let G be the graph consisting 
of the edges and vertices of these triangles. Splitting the boundary into four parts 
as above, we can ask for the probability that in a random coloring that is proper on 
the boundary, ab and cd are connected by a black path. We call this the crossing 
probability (from ab to cd). 

Next, consider a sequence G1,Gp2,... of such graphs, with the edge-lengths of 
the grids tending to 0. Suppose, furthermore, that the corresponding domains 0, 
tend to a simply connected domain 2 in the Carathéodory sense. Let the boundary 
polygon of 2, be split into four arcs as before by edges Gy, bn,cCn and dy, and 
assume ay, > a € OO, bn 3 b € ON etc. 

It can be proved that the crossing probability between the arcs a,b, and Cyndy, 
tends to a value p = p(Q,a,b,c,d) which depends on the domain 2 and on the 
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four points a,b,c,d € OO only. The surprising fact, conjectured in |Cardy 1992), 
based on computational experiments and ingenious intuition, is that this value 
is invariant under conformal transformation. This fact was eventually proved in 


[Smirnov 2001]: 
Theorem 8.1. Let 2 be a simply connected domain in the plane, let a,b, c,d, € 
OQ, and let f : Q —> C be a bijective conformal map. Then p(Q,a,b,c,d) = 
P(F(Q), f(a), £(o), Fle), F(d)). 

This allows us to compute the crossing probabilities for any domain (assuming 
we can compute that map f sufficiently explicitly), when perhaps ad hoc methods 
can be used. 
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FIGURE 8.2. Three ways of randomly walking down from the top 
of a triangle to the bottom on the streets amidst hexagonal build- 
ings. Top: a Galton board; left: random walk (not a proper draw- 
ing, which would pass too many streets); right: along the line 
separating a random 2-coloring of the hexagons. 


A particularly nice special case was conjectured by and proved by 
[Smirnov 2001]. Consider a triangular domain T in the regular triangular lattice, 
with vertices a, b and c. Take those hexagons in the dual lattice whose center is a 
lattice point in T; but omit the hexagon containing a. Let all the hexagons along 
the edge ab be colored black, all hexagons along the edge ac be colored white. Let 
the other hexagons be colored randomly. Start a polygonal walk between the two 
hexagons on the top, and keep going so that you always have a black hexagon on 
your right and a white hexagon on your left. Eventually, you will exit at a point y 
on the edge bc. Then Cardy’s Formula tells us that the point y is asymptotically 
uniformly distributed along the edge bc, if the edge length of the lattice tends to 0 
(Figure[8.2| right). Another way of saying this is that if x is a point on the edge bc 
of a regular triangle abc, then 


@ 
O 


C—-2@ 
p(T; a,b,c, x) = a 
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The other two figures illustrate other ways of walking down from a to bc: the 
Galton board (when we move always down, choosing a left or right turn randomly), 
and a random walk. The Galton board produces a binomial distribution, which is 
asymptotically Gaussian. 


8.1.3. Paving your porch, or the Fortuin—Kasteleyn model. The fol- 
lowing construction [Chelkak-Smirnov 2012], providing an inter- 
esting class of strongly analytic functions, will be important for its applications in 
physics. Here we start with an elementary mathematical formulation. 

Consider the tiling of a lattice domain (a “porch” ) with square tiles that have 
two circular arcs on their face, with radius half of their side length, centered at op- 
posite corners (Figure[8.3). There are two “doors”, special tiles marked by arrows. 

Such a tile has two possible orientations. We assume that the tiles on the 
boundary of the domain are oriented alternatingly, except for the doors. More 
precisely, we may assume that in the 2-coloring of the vertices, entering the door on 
the left edge of the picture, we see a white vertex on our left. Then the black vertices 
on the upper boundary have partial circles about them (which may consist of one, 
two or three quarter-circles), and so do the white vertices on the lower boundary. 
The arcs that constitute these incomplete circles will be called ornamental, and the 
edges of the tiles incident with ornamental arcs will be called external. No matter 
how the remaining tiles are put down (including two on the boundary, at the doors), 
the non-ornamental arcs on the tiles form a single open curve y connecting the two 
doors, and a number of closed curves (possibly quite long and winding). 
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FIGURE 8.3. Tiling of a porch with tiles of a commercial pattern. 
The 2-coloring of the nodes is shown only along the boundary, the 
2-coloring of the squares, only near the doors. Tiles on the bound- 
ary are placed so that they form a neat sequence of incomplete 
circles, interrupted only at the two doors. No matter how we put 
down the tiles in the middle, we get a single curve connecting the 
two doors, together with a collection of closed curves. 


Suppose that we have a sequence of lattice domains 2, with the size of tiles 
tending to zero, so that the domains “tend to” a simply connected domain 2. 
Let the two doors (shorter and shorter edges) converge to points a and b on the 
boundary of 2. 
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Let us tile the domains 2, randomly, under the boundary conditions as dis- 
cussed above. There is a complication here: not every tiling has the same probabil- 
ity; we weight each tiling T with w(T) = (/2)# of closed curves and choose a tiling 
with probability proportional to its weight. Some explanation for this strange choice 
of weights will be forthcoming when we discuss the connection of tilings with the 
Ising model. We call this probability distribution on tilings the physics distribution 
(as opposed to the uniform distribution). 

Each tiling will define a random door-to-door curve y,. It turns out that the 
distributions of these random curves will tend to a distribution T'(Q, a, 6) on Jordan 
curves inside 2 connecting a and b. This is not trivial, but not too surprising so 
far. The amazing fact is that this limit distribution will be invariant not only under 
translations (trivial), similarities (a matter of normalization), rotations (nontrivial), 
but under all conformal transformations! If we have a conformal map @: Q 31, 
and y is a random curve from the distribution T'(Q,a,b), then the distribution 
of ¢(y) is T'(0', ¢(a), ¢(b)). This result was proved in [Smirnov 2010a]; see also 
for a more informal presentation, and for 
a generalization to planar graphs with a rhombic embedding (a.k.a. isoradial graphs 
or critical graphs), but also for more details and applications. 

To illustrate why this result is surprising, let us consider a very special case: 
Q is the unit disk D, and a and 6 are diametrically opposite points. The situation 
looks quite different when the segment ab is horizontal (so parallel to the sides of 
the tiles), and when it is rotated by some angle. Nevertheless, in the limit the 
orientation of the lattice plays no role in the limit. 

It is not easy to make the above discussion precise. The convergence 0, 4 Q 
can be defined in the Carathéodory sense: there are conformal maps ¢, : D — 
Q, and d: D —+ Q so that ¢, > ¢ inside D. The limit distribution of a- 
b curves is much more difficult to define and describe; this needs the theory of 
stochastic Lowner evolution (SLE) developed by [Schramm 2000), [Schramm 2007], 
whose presentation is outside the possibilities of this book. 

Let us formulate one consequence of the method that can be formulated, and its 
proof can be sketched, without invoking the full machinery of SLE. Let us consider 
a porch with four doors a, b,c, d (in this counterclockwise order). These mean four 
special edges on the boundary. Assume that the number of edges of the boundary 
between consecutive doors is even, and that the tiles incident with these edges are 
fixed so that the vertices at an odd distance from the nearest door are centers of 
partial circles (Figure[8.4). 

If we lay down the remaining tiles arbitrarily, a door-to-door curve 71 will start 
at a. It is easy to argue (using parity) that +; must exit the porch either at b or 
d. Another curve 72, disjoint from 71, will connect the remaining two doors. This 
tells us that there are two possibilities, and we may wonder what is the probability 
that the curve starting at a exits at bD. 


Theorem 8.2. Consider a sequence Q, of lattice domains, tending to a simply 
connected domain Q. Let an, bn,Cn,dn be four doors on Qn, and suppose that they 
tend to four different points a,b,c,d on the boundary of Q. Then the probability 
that for a random tiling a door-to-door curve connects a, and by, tends to a value 
p(Q; a,b, c,d), and this number is conformally invariant: if @: Q > Q! is a 
conformal map, then p(Q; a, b, c,d) = p(X; d(a), o(b), d(c), o(d)). 
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FicureE 8.4. A tiling with four doors. 


The proof of this theorem (even in a somewhat sketchy form) will take a lot 
of preparations, to be done in the Section [8.2.1] But first (to justify the interest 
in this seemingly quite special question), we discuss its relation to an important 
problem in statistical physics. 


8.1.4. Ising model in the plane. In this most classical model in statistical 
physics, we consider a lattice domain and its lattice graph G, and assume that 
there is an atom sitting at each node. Assigning to every node of G (every “site”) a 
“state”, which can be UP or DOWN, we get a “configuration”. The configuration 
can be described by the set S of atoms in the UP state (Figure (8.5). 
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FIGURE 8.5. Three kinds of states in an Ising model: a disordered 
state, a state close to the homogeneous ground state in the fer- 
romagnetic case, and a state close to the chessboard-like ground 
state in the antiferromagnetic case. 


Two atoms that are adjacent in the grid have an “interaction energy”, which 
depends on their states. In the simplest version of the basic Ising model, the 
interaction energy depends only on whether or not the atoms are in the same 
state. Adding the same constant a to the interaction energies shifts the total 
energy by a|E| independently of the configuration, and such a shift is irrelevant; 
so we may assume that the interaction energy between atoms in the same state 
is 0. Multiplying the other interaction energy by a positive number means only 
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changing the unit of temperature, so we may assume that the interaction energy 
between atoms in different states is either J = 1 or J = —1. 
The total energy of a given configuration S is 


H(S) = J\E(S,V\S)I. 


Basic physics (going back to Boltzmann) tells us that the system is more likely to 
be in states with low energy. In formula, the probability of a given configuration 
is proportional to e~#(5)/T, where T is the temperature (from our point of view, 
just a parameter). Since probabilities must add up to 1, these values must be 


normalized: 
e A(S)/T 


P(S) = —., 


where the normalizing factor Z is called the partition function of the system. The 
partition function is simple to describe: 


Z= > ener 
SCV 
but the number of terms is enormous, and so partition functions can be very hard 
to compute or analyze. 

The behavior of the system depends very much on the sign of J. If J = 1, then 
adjacent pairs that are in the same state contribute less to the total energy than 
those that are in different states, and so the configuration with the lowest energy 
is attained when all atoms are in the same state. The typical configuration of the 
system will be close to this, at least as long as the temperature T is small. This 
is called the ferromagnetic Ising model, because it explains how materials like iron 
get magnetized. This will be the case we consider in this section. If J = —1 (the 
antiferromagnetic case), then the behavior is different: the chessboard-like pattern 
minimizes the energy, and no magnetization occurs at any temperature. 

One may notice that the temperature T emphasizes the difference between the 
energy of different configurations when T —> 0 (and de-emphasizes it when T — oo). 
In the limit when T’ > 0, all the probability will be concentrated on the states with 
minimum energy, which are called ground states. In the simplest ferromagnetic Ising 
model, there are two ground states: either all atoms are in state UP, or all of them 
are in state DOWN. As the temperature increases, disordered states become more 
likely. The transition from the ordered state to the disordered may be gradual (in 
dimension 1), or it may happen suddenly at a given temperature (in dimensions 2 
and higher, for large graphs G). This temperature is called the critical temperature, 
and the phenomenon itself, a phase transition. 


8.1.5. Porch tiling and the Ising model. Our next goal is to show the 
connection between the Ising and the porch tiling models. We start with rephrasing 
the porch tiling model, at least at the critical temperature. Let us draw, for each 
tile, the diagonal that does not intersect the quarter-circles. The new edges form a 
subgraph of the union of the two lattice graphs Gp and G; on GgNV and G,{NV. 
If we specify the subgraph Hp on GoM V, then the other subgraph Hj is determined 
as the graph of those edges of G; that do not cross the edges of Hp. So a porch 
tiling can be specified by a subgraph Ho of Go (we keep all nodes). 

Note that each connected component of Ho, as well as each connected compo- 
nent of H,, is surrounded by a closed curve of the tiling, except for the connected 
components containing the two boundary arcs. So the number of closed curves is 
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FIGURE 8.6. A porch tiling, and subgraphs of Go and G, sur- 
rounded by closed curves. 


c(Ho)+c(H1)—2 (recall that c(H) denotes the number of connected components 
of the graph Hf). 

The component numbers c( Ho) and c(H) are related. Consider the graph Go 
formed by the black nodes and the diagonals between them. Add a new black node b 
outside that lattice domain D, and connect it to every black node on the boundary. 
This way we get a planar graph en The dual of this graph is the graph Gi obtained 
from G, by adding edges crossing the edges incident with b. From the graph Ho 
(as defined above) we construct a subgraph Jo of Go by adding to it the new node 
b, all edges connecting b to black nodes on the upper arc between the doors, and 
(say) to the black endnode of the left door. The corresponding subgraph of G, will 
be obtained from Hy by adding all edges of the path connecting consecutive white 
nodes on the lower arc, and ending at the white endnode of the right door (Figure 


(8.6). It follows that 
(8.1) e(Ji) = c(Jo) + |E(Jo)| — |V (Jo)| +1 = e(Jo) + |E(Jo)| —|V(Go)| 
(recall Exercise |2.3). It is also clear that c(Jo) = c(Ho) and c(Ji) = c(H1), since 
the additional part simply connects the ornamental centers to the main part. 

Let &9 be the number of black nodes on the lower arc, then |E(Jo)| = |E(Ho)|+ 
fy. Using (8-1), we get that c(H,) = c(Ho)+|E(Ho)|+40—|V(Go)|. Thus the 
weight we assign to a tiling (to define the physical distribution) is 


w(T) = (/2)# of closed curves __ (/2)°(Ho)+e(Ha)—2 ~~ g\|E(Ho)|/2+e( Ho). 


The fact that Ising model is closely related to the porch tiling model is not 
obvious, since the Ising model concerns subsets of the nodes, while the tiling model 
(after the reformulation above) concerns subsets of the edges. To connect the two 
models, let G = (V, £) be an arbitrary graph (we do not even need planarity for this 
argument), and let us consider pairs (S,Y), where S C V and Y C E[S]UE[V\ S}. 
Fix a parameter 0 < p < 1, and consider a probability distribution on such pairs, 
where 

P(S,¥) ~ pl*\(1—p)\¥l wv = 
l—p 
(we have to normalize the given values to get a probability distribution). This value 
is the same for all sets S that are unions of connected components of the graph 
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(V,Y); the number of such sets is 2°"). If we generate a pair (S,Y) from this 
distribution and then forget S, then we get a probability distribution on subsets of 
F for which a 
P(Y) ~ (+) gelV.¥), 
Wap 
The value p = 2—/2 plays a special role: this corresponds to the critical tempera- 
ture in the Ising model. In this case, p/(1—p) = V2, and so 


P(Y) rd gl¥1/2+e(V,Y) 


This is the same probability distribution as coming from the porch tiling model. 
On the other hand, let us forget about Y. Fixing S and setting E’ = E[S]U 
E|V \ S|], we get 


RiSje Sow aape™ 


YCE’ 
= (1—p)lZ(SV\5)1 S- pl¥|(1—p)/2'\¥| a (1—p)ZGV\S)1, 
YCE’ 
Defining the temperature T so that 1—p = e7!/7, we get 


P(S) ~ eTBVASN/T 
This is just the probability distribution in the Ising model. 


8.2. Analytic functions and porch tilings 


The remainder of this chapter is devoted to sketching the proof of the conformal 
invariance of the porch tiling model (a.k.a. the Fortuin—Kasteleyn random cluster 
model). 


8.2.1. A strongly analytic function. We consider a lattice domain D, with 
the acyclic orientation: every edge of Gp is oriented from its black endpoint to its 
white endpoint. We orient the graph G, (the tangency graph of the squares) in 
the cyclic sense (the boundary of every black country is a counterclockwise oriented 
cycle). 

Let us start with a simple observation. Tracing the curve y from the left door 
to the right door, we cross several edges. A different tiling will create a different 
curve, crossing different edges. However, if an edge is crossed by both curves, it is 
crossed in the same direction. Indeed, following the curve, we see that every edge 
is crossed so that its black endpoint is on the right (so every edge is crossed in the 
same direction as the oriented edges of G7, crossing it). 

It is also easy to observe that if we consider a random tiling (say, from the 
physics distribution, but here we could talk about any distribution), and for an 
edge e we denote by p(e) the probability that the door-to-door curve crosses e, 
then for the four edges e1, €2, 3, e4 of a tile (in this cyclic order), we have 


p(e1) +p(es) = p(e2) + plea) 


(since every curve loads {e1,e3} by the same amount as it loads {e2,e4}). Our 
main tool below will be a similar relation for the physics distribution even if we 
assign a “phase” (a unit complex vector) to each crossing. 

Let us remove those boundary tiles that contain two ornamental arcs; they play 
no role. Those tiles that contain one ornamental arc have a fixed orientation, but 
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their non-ornamental arc can be part of the curve y. Let us call them boundary 
tiles. 

We are going to construct a strongly analytic function on the graph G7, of this 
grid from these tilings. Recall that we define a weight of a tiling as 


w(T) = (V2)# of closed cycles 


The total weight, in physical terms the partition function, is Z = }),w(7). The 
ratio w(T)/Z gives the probability of the tiling in the physics distribution. 

Next, we construct a function on the edges of G. Set 6 = e™*/4 = (1+i)/V/2 for 
convenience. Given a tiling T (we keep the tiles on the boundary fixed, except for 
the two tiles in front of the doors), we trace out the curve y connecting the doors, 
starting (say) at the left door. Along the curve, we count turns with sign: when 
crossing a tile, we make either a left turn, which counts 1, or a right turn, which 
counts —1. When an edge e is crossed, this tiling 7 contributes 


(8.2) frie) = _ Q7# of turns up to e 
to the function value of e; edges not crossed by y receive no contribution from this 
tiling. 

As pointed out above, the curve 7 crosses e in the same direction for every 
tiling T. It follows that f7(e) is parallel to the same line for all 7; f7(e) is real if e 
is a vertical edge oriented down (in the acyclic orientation), f7(e) is a real multiple 
of @ if e is a horizontal edge oriented right to left etc. We can formalize this as 
follows: f7(uv) is parallel to \/i(u—®) (when we talk about an edge uv, we assume 
that it is oriented from u to v; so in this case u is black and v is white). 


Nucet cee 


FIGURE 8.7. Directions in the complex plane of the contributions 
of door-to-door curves to f. They are determined by the edge up 
to the sign. 


We define, for every edge e = uv, 


(8.3) f(e) = S> fr(e) = E(1(e € ¥)o-# of turns up to °), 
T 


where the expectation is taken for a random tiling from the physical distribution. 
By the observations above, 


(8.4) f(uv) | Vi(u—v) and |f(e)|<Pleey) <1. 
Let us consider a tile S, and let e1, e€2,€3,e4 be its edges in counterclockwise 
order. If S is an internal tile, then we define 


(8.5) (8) = 5(f (es) + Flea) + Fes) +4 (64)). 
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For a boundary tile, we have to modify this definition. We may assume that e1 
and €2 are its internal edges (the other two edges carry no information). We define 


(8.6) g(S) = (2—V2)(f(e1) + f(e2)). 


Note that if a door-to-door curve ¥ intersects this tile at all, then it intersects both 
e, and eg. If the tile is on the upper boundary, then y must enter through e; and 
make a right turn inside. So f(e2) = 0f(e1). For tiles on the lower boundary, the 
curve crosses e; and e2 in the reverse order, but the relation f(e2) = 0f(e1) remains 
valid. 


Lemma 8.3. The function g is strongly analytic on the graph whose nodes are the 
tile centers, and in which two centers are connected by an edge if the tiles share an 
internal edge. 


Proof. We compare the contributions from a tiling J and from the tiling 7’ 
obtained by rotating an internal tile S by 90°. Let y and 7 be the door-to-door 
curves in J and 7’, respectively. If y does not cross S', then neither does y’, and 
all these contributions are 0. If 7 crosses S, then so does 7’; one of them crosses 
once, and the other crosses twice (Figure|8.8). Let (say) 7 cross twice and 7 cross 
once. We may assume that y enters S through e;. There are still two possibilities 
to consider, namely whether y turns left or right after entering S. We discuss the 
case of a left turn; the other case is analogous. 
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FIGURE 8.8. Rotating a non-boundary tile that is crossed by ¥. 


We can express all contributions in terms of a = f7(e1): 
fr(ez)=0a, fr(es)= ia, fr(ea) = Oa, 
and (since there is one more cycle in 7’ than in 7) 
fri(er) = V 2a, frr(e2) =OV2a,  frr(es) = frr(ea) = 0. 
Hence 
fr(er)+fre(er) = (V2+1)a, fr (e2) + fr(e2) = (V2+1)6a, 
fr(es)+fr(es) = ia, fr(ea)+ fr (ea) = Ga. 


Notice that 


fr(e1) + fr (e1) + fr (es) + fr (es) = fr (e2) + fr (e2) + fr (ea) + fr (ea). 


Summing over all tilings, we get that 


(8.7) fer) + flea) = Flea) + Flea) = g(S). 
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Consider an edge e = uv of G between two non-boundary tiles S and S’, so 
that S” is to its left. Let pg and pg denote their centers, and let st and xy be the 
edges of S and S’, respectively, opposite to uv. Then 


9(S) —9(S") = (f (st) + f(ur)) — (Fey) + f(ur)) = f (st) — f(xy). 
Since both f(st) and f(xy) are parallel to /i(fé—s) = /i(y—2) = V/i(u—B) = 
Vs —Bg, so is their difference: g(S')—g(S’) || Y0s—Vg-. Similar computation 
shows that this relation remains valid for edges between a boundary tile and a non- 


boundary tile, as well as for non-ornamental edges between boundary tiles. This 
proves that g is strongly analytic. 


For the special edge at the left door, we have fy(e) = wT) and so f(e) = 1. 
The tile S next to the door satisfies R(g(S)) = 1. 

From the fact that f(e,) lL f(e3) and f(e2) L f(ea), it follows that 
describes a decomposition of g(S) into the sum of two orthogonal vectors in two 
different ways. In other words, f(e;) is the orthogonal projection of g(.S') onto the 
corresponding direction (Figure left shows how the values of f on the edges 
can be expressed if g(S) is given in the standard form). It is easy to check that this 
relation is also valid for the non-ornamental edges of boundary tiles (the definition 
of g was adjusted so that this should hold). In particular, 


(8.8) If(ex)? +1 f(es)I? = IF (ea)? +1F (ea)? = 16S)? 


for the four edges of an internal tile S. 
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FIGURE 8.9. Left: the values of the function f on the edges of a lattice 
square, in terms of the value of g. Right: incisions at the ornamental 
arcs. Only part of the boundary is shown. 


For a boundary tile S, where f; and f2 are its internal edges, we can define 
f(e3) = (V2—1)if(e1) and f(e,) = —(V2—1)if (ez) for the other two edges. Then 
g(S) satisfies (8.5). We have to be careful here on two counts: first, this definition 
of f(e3) and f(e4) does not have the same combinatorial meaning as it does for 
internal edges (which would make these values equal to zero); second, one or even 
both of the edges e3 and e4 may be shared by other boundary tiles, and then the 
values computed from those tiles might be different. It is best to think of making 
an incision between the two boundary tiles along such edges (or scratch out the 
grout, if you wish). A boundary node with two (or three) ornamental arcs around 
it is split into two (or three) separate nodes of degree two (Figure[8.9] right). A tile 
containing two ornamental arcs will be completely cut off, and it can be discarded, 
playing no role in the arguments. This domain with incisions will be denoted by 
D"°, and the corresponding graph by G™ = (Vi"*, E™°), 
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The f-values of any two edges of a lattice square S' determine the value g(S), 
as well as the f-values of the other two edges of S. For two opposite edges, this 
follows by (8.7). It is not hard to calculate that 


(8.9) g(S) = (1-1) f(e1) +(1+4)f (ea), 
and 


(8.10) fles) = —ifler)+ +i) f(e2), flea) = 1 -) Ff (er) +2f (ea). 

For boundary tiles in the graph G'™°, expresses g as a linear combination of 
f(e1) and f(e2) with different coefficients; nevertheless, these formulas remain valid 
for these tiles as well (since f(e,) and f(e2) are not independent, but related by 
f(e2) = Of (er). 

To formulate a further key property of the function g, we describe another way 
of reducing the domain: the “irrelevant fringes” can be removed. To be precise, 
we cut each boundary tile into two triangles by the diagonal separating the arcs on 
the tile, and delete the triangle containing the ornamental arc. In particular, we 
delete all tiles with two ornamental arcs (Figure [8.10). This reduced domain will 
be denoted by D'®*, and the corresponding graph by G'ed = (Vt°¢, Ere¢), Note 
that the graph G'@¢ is no longer a subgraph of the grid, since the diagonals of the 
tiles where we have cut them are new (longer) edges on the boundary now; we call 
these edges diagonal edges. In the 2-coloring of the nodes, let (say) all the nodes 
on the lower arc be black, and all the nodes on the upper arc be white. We also 
need to define the orientation of the edges of the boundary arcs: we orient them 
away from the left door (along the boundary). 


FIGURE 8.10. Cutting off the fringes of a porch tiling. 


Lemma 8.4. Let uw be a diagonal boundary edge, obtained from the boundary tile 


S. Then g(S) || /w—ut. 


Proof. Let ww be an edge of (say) the upper boundary arc of the porch between 
the doors, where this boundary arc passes the edge from u to w. Let v be the third 
vertex of the triangle containing ww. Then, as we have seen, f(uw) = 0f(vu), and 
so 


9(S) || f(vu) + f(vw) = (1+8) f(vu) |] (1+0)Vie-a) 
= JEG —w || \/4@-w) || Vo=w. 
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8.2.2. Guessing the limit. Lemma|8.4|suggests what the limit of the discrete 
analytic functions g should be when we are looking at tiling the same porch D with 
smaller and smaller tiles. Let us think of the difference w—wu as the “discrete 
tangent vector” of the domain D*@¢ (following the boundary from the left door to 
the other one). Then we have a discrete analytic function on D'*¢, whose value is 
parallel to the square root of the conjugate of the tangent vector on the boundary. 
What would the continuous analogue of such a property be? 

Let 2 be a simply connected domain in the plane with a smooth (or piecewise 
smooth) boundary, and let a and b be two points on its boundary. Let ®: Q > CU 
{co} be a conformal map that maps 2 onto the strip 0 < 3(z) < 1, so that ®(a) = 
—oo and ®(b) = +o0. The mapping ® is determined only up to an additive real 
constant, but this will not matter, since we need its derivative ®’ only. Furthermore, 
since the map is one-to-one, its derivative is not zero in the interior of 2, and hence 
V®’ can be defined choosing the branch of the square root with %(./®’(c)) > 0 for 
a point c on the counterclockwise lower arc. 

Let » = (4|,)~1, then ®(¥(s)) = 8, and differentiating, we get 
®’ (u(s))v’(s) = 1. This means that 

1 


/ a 1 wits) 
WO) = Fe = wor’ 


where w’(s) is a tangent vector to the domain 2 at the boundary point ~(s). Thus 


’ = 1 alt(e): 
® (w(s)) |2’(s)| w (3); 
the square root on the right is well defined once we fix its sign at (say) s = 0 
appropriately. This suggests that if Q is “close” to D, then (®’)!/? could be “close” 
to g. 

To make this more precise, let G, (n = 1,2,...) be the graph of a simply 
connected lattice domain D,, of the lattice G,, = ¢,G, where ¢,, — 0. Assume that 
Dy 2 Q, where 2 is a simply connected bounded domain in the plane, and the 
convergence is in the Carathéodory sense. 

Let fn and gn denote the functions defined by and (8.5) for the lattice 
graph G,, of D,. We can consider g,(z) to be defined on all points z € D, as the 
value of g, on the square containing z (ties broken arbitrarily). The main theorem 


of [Smirnov 2010b| and |Chelkak—Smirnov 2011) is the following. 


Theorem 8.5. With the notation above, 


1 
gn > VO 
Te 


uniformly on every compact subset in the interior of Q. 


The considerations above imply that if the functions gn, /./én tend to any func- 
tion, then this must be analytic and in fact equal to /®’. But to show that the 
sequence of functions gn /\/En is convergent is highly nontrivial, and it takes careful 
estimates based on analysis and on the theory of random walks. 


8.2.3. An almost harmonic function. The proof of Theorem [8.5] depends 
on the introduction of an auxiliary function on the nodes, that is “almost” har- 
monic, and which is interesting on its own right. Equation means that the 
function |f(e)|? is rotation-free (on the edges of the domain D™, with respect to 
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the acyclic orientation). Hence it has a potential, which means that there is a 
function H: Vi" — R such that 


(8.11) H(v) —H(u) = |f(wr)/? 


for every (oriented) edge uv € E™. 

The difference of H along a diagonal of a square can be expressed in terms of 
the function g. Let v and z be diametrically opposite vertices of a lattice square S, 
then 


(8.12) H(v) H(z) = 5%e((v z)g(S)7). 


This follows by direct computation using and (8.10). It is easy to derive from 
this an expression for the analytic defect of H: 


(8.13) H(v1) +H (v2) +H (vs) +8H (04) = S*9(5)? 


for every lattice square (v1 v2v304) in D. 
Lemma 8.6. The function H is constant along both boundary arcs of D*°*. 


Proof. Consider a boundary triangle uvw, with vertex v at the right angle. 
Let, say, v be white. Then by the argument in the proof of Lemma [8.4] we have 
f(wv) = Of(uv) or f(wv) = Of (uv). In both cases, 


H(w)—H(u) = H(w) —H(v)+H(v)—H(u) = |f(vw)|?—|f(vu)|? = 0. 


Let us add that the values of H on the upper boundary arc of D'®* are one 
larger than its values on the lower arc of this domain, since f(uv) = 1 for the edge 
uv at the left door. Since H is determined only up to an additive constant, we may 
assume that 


(8.14) He f on the upper arc of D'°¢, 


0 on the lower arc of D'¢. 


It follows from the definition of H that if wv is an edge of G (oriented from wu to 
v), then H(u) < H(v). 

This function H is not harmonic in general, but it has the following useful 
weaker property: 


Lemma 8.7. Let v be a white internal lattice point, and let v,,..., v4 be the lattice 
points diagonally opposite to v in the squares containing it. Then 


1 
H(v) < 7 (Alvi) +H (v2) +H (vs) +H (v4), 
i.e., H is subharmonic on the white grid graph in the interior of D. Analogously, 
HA is superharmonic on the black grid graph in the interior of D. 
Proof. Let w1,...,u4 be (black) neighbors of v, and set fy, = f(uxv) (Figure 
8.11). It suffices to prove the identity 
(8.15) A (v1) +H (v2) + H (v3) + (v4) —4H(v) = 2\ fi -ife— fs +ifal’. 
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FIGURE 8.11. 


The proof involves some computation. By (8.10), f(ueve) = —ifk4i+(1+¢) fe, 
and hence 


H (vg) —H(v) = H(vp)— H(ug) +H (ux) — H(v) = |f(uev)|? -|f(unve) |? 
= |frzil? + fel? -—A+OF efegi— a fe P eq 


Note that fi+1 || (1-7) fx, and this implies that (1-1) f;,frz41t+ (+i fefeys = 9, 
and so we can write 


(8.16) H (vp) — H(w) = | fetal? +| fl? — 20 Fa feti — fefras)- 


On the other hand, expanding 2|f,—if2—f3+if4|? we get terms 2|f,|?, terms 
2i(f.fet+i1—Skfr4i), and terms containing f;, frs2+fefps2, which is zero since 
fr+2 || if. So we get the same expression as obtained by adding up the right hand 


sides of (8.16). This proves the identity (8.15). 


A useful consequence of this lemma is that for every node u, 


(8.17) 0< H(u) <1. 


Indeed, to see e.g. the case when wu is black, we notice that it holds true if u is 
a node on the lower are of D'*¢ (when H(u) = 0), and also if u is a neighbor of 
a white node v of the upper are of D'*4 (because then H(u) < H(v) = 1 and 
H(v)—H(u) = |f(uv)|? <1). So H(u) > 0 on all black nodes by its superharmonic 
property. Similarly, H(v) < 1 for all white nodes. Finally, if u is a black node, v is 
a white neighbor of u, then 0 < H(u) < H(v) < 1. 

Consider the restriction of H to the black nodes on the boundary of D'™°, and 
extend it to a harmonic function hg on the black grid inside D™°. We define hw 
analogously. It follows from Lemma that hg(u) < H(u) for every black node 
u, and hw(v) > H(v) for every white node v. 

Let us add that for a black node wu, and any (white) neighbor v of u, H(u) < 
H(v) by definition, so (informally) we may have in mind the inequalities 


(8.18) hp < Alp“ <”H|w < hw 


(we have to put quotes around the middle inequality sign, since H|p and H|w are 
defined on different sets). The functions hg and hw are harmonic extensions of 
almost the same values on almost the same sets. So if we can show that hg + hw 
on the boundary, then we will know that H = hg, so H is approximately harmonic. 
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8.2.4. Sketch of the proof of Theorem Let H,, denote the function 
defined by and for the graph G,,, and define hg, and hw,,, analogously. 

First, it suffices to prove that there is a subsequence of the sequence (G, : n = 
1,2,...) for which the convergence holds as claimed in the theorem. This follows 
by a standard “subsequential limit” argument. 

Second, there is a subsequence for which g,/\/€n converges, uniformly in the 
interior of 2, to some function g: 0 — Z, and also H,, hg, and hy,.,, converge to 
appropriate functions H wh aoe : QR. (This step takes some careful estimates 
of harmonic and sub/superhamonic functions on the grid, for which we refer to the 
papers of Chelkak and Smirnov.) From the fact that the functions g, are discrete 
analytic, it follows that g is analytic (in the classical sense). Using and 
(8.18) (in an exact quantitative form), one can prove that H= fen a Bus and it is 
a harmonic function (in the analysis sense). Furthermore, the boundary values of 
H are 0 on the lower a-b arc of OQ and 1 on the upper a-b arc. 

Third, notice that the function 3(®) (where ® is defined in Section |8.2.2) has 
the same properties: it is harmonic, 0 on the lower arc, and 1 on the upper arc. It 
follows that H = 3(®) and so 

Hy, > 3(®) 
in the interior of 2. 

Fourth, (8.12) implies that for any node u and any path P = (uo,...,uzx = u) 
in Gp connecting u to a (black) node ug on the lower arc, we have 
(8.19) 


i Uj —Uj—1 2 1 : In (Sj) \? 
Ay,(u) = 3 (ge m8 ) = pee) ), 


where S; is the lattice square with diagonal u;_;u;. The sum on the right is an 
approximation of the integral of g? /e,, along P (taking into account that the lattice 
EnGo has edge length V2€n). it follows that 


H,(u) > sa(f g(z)? dz), 


where v is any point on the lower arc. So it follows that 


n(5 [9 dz—i®(u)) =0. 


Since the function in the large parenthesis is analytic, we can conclude that not 
only its real part is zero, but so is the function itself: 


sf oe” dz = i®(u). 


Differentiating and taking square roots, we get the theorem. 


8.2.5. Sketch of the proof of Theorem [8.2} Let us add a curve € on the 
outside connecting doors a and 6 (Figure |8.12). For every tiling, there will be a 
unique curve from door c to door d, which may or may not go through €. In fact, 
it goes through € if and only if the door-to-door curves of the tiling (without €) 
match a with c and b with d. The arguments above can be extended to the porch 
with the additional curve to show that the probability that the door-to-door curve 
goes through € is conform invariant. 
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Exercise 8.1. A lattice domain with the same number of black and white squares 
can be domino tiled unless there are two boundary vertices x, y whose distance in 


the reduced dual graph is less than their height difference. 


Exercise 8.2. Let v and z be diametrically opposite vertices of a lattice square 


S. Prove that H(v) — H(z) = $8((v—z)9(S)’). 


Exercise 8.3. Let e1,...,e4 be the edges incident with a lattice point in the 
interior of D, and let S1,...,54 be the squares incident with it. Prove that 


I9(Si)I? +--+] 9(Sa)l? S 2 F(er)? +--+ 1F(ea))?)- 


CHAPTER 9 


Adjacency Matrix and its Square 


In the previous chapters, the vector-labelings we considered were mostly in 
low dimensions, and planar graphs played the major role. Now we turn to repre- 
sentations in higher dimensions, where planarity plays no role, or at most a little 
role. 

Perhaps the easiest nontrivial way of assigning a vector to each node is to use 
the corresponding column of the adjacency matrix. Even this easy construction has 
some applications, and we will discuss one of them. 

However, somewhat surprisingly, it turns out that a more interesting vector- 
labeling can be obtained by using the columns of the square of the adjacency matrix. 
This construction is related to the Regularity Lemma of Szemerédi, one of the most 
important tools of graph theory, and it leads to reasonably efficient algorithms to 
construct (weak) regularity partitions. 


9.1. Neighborhoods, rank and size 


If we know the rank of the adjacency matrix of a graph, how large can the 
graph be? Obviously we have to make some assumption, since we can replace any 
node by an arbitrary number of twins without increasing the rank of the adjacency 
matrix. However, if we exclude twins, then the question becomes meaningful. 

An almost trivial answer is given by the following argument. Let r = rk(Ac) 
be the rank of the adjacency matrix of the simple graph G (we will call this number 
simply the the rank of G), and let (say) the first r columns of the adjacency matrix 
form a basis of the column space. Two rows that agree in their first r positions will 
agree everywhere, which is impossible if there are no twins. So the initial r-tuples 
of rows will be all different, and hence there are at most 2" rows. 

This argument works over any field, and no better bound can be given over 
GF (2), for example (see Exercise (9.2). However, over the real field we can use 
geometry, not just linear algebra, and prove a substantially better (almost optimal) 


bound |{Kotlov—Lovasz 1996]. 
Theorem 9.1. Let G be a twin-free graph on n nodes of rank r. Then n = O(2"/2). 


Before turning to the proof, we recall a result from discrete geometry. The 
kissing number s(d) is the largest integer N such that there are N nonoverlapping 
unit balls touching a given unit ball in R?. For d = 2, it is easy to see that 
this number is 6, but the 3-dimensional case is already difficult: whether the 3- 
dimensional kissing number is 12 or 13 was the subject of a famous debate between 
Isaac Newton and David Gregory in the 17-th century (the fact that s(3) = 12 was 
not proved in a watertight way until [Schiitte-v.d.Waerden 1953]). 

The exact value of s(d) is only known for a few other small di- 
mensions, but the following bound, which follows from tighter estimates 
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Kabatjanskii—Levenstein 1978], will be enough for us: 
(9.1) s(d) = O(2%/?), 


In other words, every set of more than s(d) vectors in R¢ contains two vectors such 
that angle between them is less than 7/3. This theorem has an almost immediate 
consequence for graphs with low rank adjacency matrices: 


Lemma 9.2. If a graph G of rank r has n > s(r+1) nodes, than it has two nodes 
i and j such that |N(i)AN(j)| < n/4. 


Proof. Let a; denote the column of Ag corresponding to 7 € V, and let u; = 
1—2a;. Clearly the vectors u; are +1-vectors, which belong to an (at most) (r+1)- 
dimensional subspace. Applying the kissing number bound to the vectors u;, we 
get that there are two vectors u; and u; forming an angle less than 7/3. For two +1 
vectors, this means that u; and u, differ in fewer than n/4 positions. The vectors 
a; and a, also differ in these positions only. 


We also need a couple of simple facts about twins. Recall that they cannot be 
adjacent. If we delete a node from a graph G that has a twin, then the rank of 
the graph is not changed. Furthermore, no new twins are created: if a pair 7,7 of 
remaining nodes is distinguished by some node u of G having a twin v, and if u is 
deleted, then v still distinguishes i and j. 


Lemma 9.3. Let G be a graph rank r. (a) If i and j (nonadjacent) nodes of G 
that are twins in an induced subgraph H, but not in G, then rk(H) <r—2. (b) Ifi 
and j are adjacent nodes in G and no node of a X CV \ {i,j} distinguishes i and 
j, then rk(G[X]) < r-1. 


Proof. (a) The matrix Ag must contain a column that distinguishes rows 
i and j, and so it is not in the linear span of the columns in X. Adding this 
column to Ay increases its rank. Adding the corresponding row increases the rank 
further. (b) The columns corresponding to X have the same entry in rows 7 and Jj, 
but column 7 has different entries in these positions, and so these columns cannot 
generate column 7. So the rank of these columns is at most r—1, which implies 
that rk(G[X]) <r-1. 


Proof of Theorem |9.1} To facilitate the proof by induction on r, we set 
f(r) = C2"/? — 16, where the constant C > 16 is chosen so that s(r+1) < f(r) for 
allr. The statement we prove is that n < f(r) for every twin-free graph on n nodes 
and of rank r. For r = 1,2,3 this is easily checked. So we may suppose that r > 3. 

If |N(@i)AN(j)| > n/4 for all pairs of nodes, then Lemma implies that 
n < s(r+1) < f(r). So we may assume that there are two nodes i and j such that 
|N(i)AN(j)| < n/4. The set X = V\(N(i)AN(j)) does not distinguish i and j, 
and hence by rue rk(G[X]) <r-1. 

The rest of the proof is a somewhat lengthy argument combining graphs and 
matrices; geometric representation has done its job. We want to find an induced 
subgraph that is twin-free and lower rank, so that we can apply induction. 

Let G[Z] be a largest induced subgraph of G with rk(G[Z]) < r—1. Since X 
induces such a subgraph, we have 


3 
(9.2) |Z| > |X| > Gn. 
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If G[Z] is twin-free, then by the induction hypothesis applied to G/Z], 
4 4 
n<SlZ|< Sfr-1) < f(r). 


So we may assume that G[Z] has twins. By Lemma|9.3} we have the stronger bound 
rk(G[Z]) < r—2. 

Let u€ V\Z. By the maximality of Z, we must have rk(G[ZU {u}]) =r, and 
hence G[ZU{u}] must be twin-free. This implies that u must distinguish every 
twin pair in Z; in other words, u is connected to exactly one member of every such 
pair. This in particular implies that G[Z] does not have three mutually twin nodes. 
Let T be the set of nodes that are twins in G[Z] and are adjacent to u, and let T’ 
be the set of twins of nodes of T. Let U = Z\(TUT"). 

Next consider any v € V\ Z different form u. We claim that it is either con- 
nected to all of nodes in T and none in 7”, or the other way around. Row v of Ag 
is a linear combination of rows corresponding to Z and u; let, say, the coefficient of 
row wu in this linear combination be positive. Then for every twin pair {i, 7} (where 
i€ T and j € 7”) we have ay; > ayu;, but (by the definition of twins) ay; = aw; for 
all w € Z, and hence a,; > dy;. This means that v is adjacent to i, but not to J, 
and so we get that v is adjacent to all nodes of T but not to any node of T’. 

Thus we have a decomposition V \Z = YUY’, where every node of Y is con- 
nected to all nodes of T but to no node of T’, and for nodes in Y’, the other way 
around. So G has the structure in Figure |9.1] 


FIGURE 9.1. Node sets in the proof of Theorem [9.1] Dashed ar- 
rows indicate twins in G[Z]. 


The graph G[[UU] is obtained from G[Z] by deleting one member of each 
twin-pair, hence it is twin-free. Since rk(G[TUU]) = rk(G[Z]) < r—2, we can 
apply the induction hypothesis to get 
(9.3) |Z|+|U| < f(r—2). 

If |YUY"| < 16, then 

n= 2|T|+|U|+|YUY"| < 2f(r—2)+16 = f(r), 
and we are done. So we may assume that (say) |Y’| > 8. Since the rows in Y’ 
are all different, this implies that they form a submatrix of rank at least 4. Using 
that in their columns corresponding to T these rows have zeros, we see that the 
matrix formed by the rows in Y’UT has rank at least rk(G[T])+4. This implies 
that rk(G[T]) < r—4. 
If G/T] is twin-free, then we can apply induction: 


(9.4) (aS Ce 
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Using (9.2), (9.3) and (9.4), we get that 
4 4 4 
n< 4iz|=4eiri+ wn < S¢0-2) +404) < FO. 


So we may assume that G[T] is not twin-free. Any twins in G/T] remain twins 
in G\U: no node in T’ can distinguish them (because its twin does not), and no 
node in YUY’ can distinguish them, as we have seen. So G\U is not twin-free, 
and hence rk(G\U) < r—2 by Lemma By the maximality of Z, we have 
|V\U| < |Z|. Using (9.3), this gives 


n < |Z|+|U| = 2|T|+2|U| < 2f(r—2) < f(r). 


The bound in Theorem |9.1]is sharp up to the constant. (This shows that even 
though the bound on the kissing number could be improved, this would not give 
an improvement of the whole argument.) Let us define a sequence of graphs G, 
recursively: Let G,; = Ko, and given G, (r > 2), let G,41 be obtained from G, 
by doubling each node (creating |V(G,-)| pairs of twins), adding a new node, and 
connecting it to one member of each twin pair. It is easy to see by induction that 
G, is twin-free, |V(G,)| = 3-2"-!—1, and rk(Ag,) = 2r. 

As a corollary, we get that if the rank of the adjacency matrix of a simple 
graph (which may have twins) is r, then its chromatic number is at most f(r). This 
argument can be refined to prove a better upper bound of y(G) = 
O(r(4/3)")). It was conjectured that x(G) < r for every graph 
G, but this was disproved |Alon—Seymour 1989]; the worst example known so far 


Nisan—Wigderson 1984], gives a sequence of graphs with y(G) = (28 ae 


9.2. Regularity partitions 


9.2.1. Regularity Lemmas. In order to formulate this application of geo- 
metric representations, we have to include a little introduction to regularity lemmas. 
The original Regularity Lemma {Szemerédi 1975] ,[Szemerédi 1978] has been the key 
to many proofs in extremal graph theory, graph algorithms, number theory, the the- 
ory of graph limits, and more; the range of its applications is so broad that we do 
not attempt to describe any of them in this book. It has formulations not only in 
graph theory but in analysis and information theory as well, and its applications of- 
ten depend on different versions of the Lemma, which are not equivalent. Stronger 
versions have been proved {Alon at al. 2000], [Tao 2006], [LovaészSzegedy 2007]. 
Another version is weaker than the original but gives much better error bounds 
Frieze—Kannan 1993], and this is the version we will be dealing with in this sec- 
tion. 

Let G be a graph on n nodes. For two sets S,T C V, let eg(S,7) denote the 
number of edges 17 € E with i € S and j € T. We do not assume here that S and 
T are disjoint; edges induced by SNT should be counted twice. We also use this 
notation in the case when G is edge-weighted, when the weights of edges connecting 
S and T should be added up. (An unweighted edge can be considered as an edge 
with weight 1.) 

We come to an important definition of this section, introduced by 
Frieze-—Kannan 1993]. We define, for two edge-weighted graphs G and H on the 
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same set V of n nodes, their cut-distance by 


1 
dp (G, H) = po any Pate) en ST): 


This is, of course, not the only way to define a meaningful distance between two 
graphs; for example, the edit distance |E(G)AE(H)| is often used. For us the cut- 
distance, which measures a certain global similarity, will be more important. Note 
that the normalization guarantees that 0 < dg(G, H) < 1 for any two simple graphs 
G and H. It is straightforward to check that dp satisfies the triangle inequality. 


Remark 9.4. One may wonder why we divide by n?, and why not by |S||T'|, which 
would also guarantee that the distance of two simple graphs is at most one. It is not 
hard to see that in this case the maximum would be attained when |S$| = |T| = 1, 
and the distance would be 0 if the two graphs are identical, and 1 otherwise. 


Remark 9.5. The notion of the cut-distance can be modified to apply to graphs 
that are not defined on the same node set; they may even have different number of 
nodes. We don’t need these extensions in this book; see for a detailed 
discussion and applications. 


Let P = {Vi,...,Ve} be a partition of V into nonempty sets. We define the 
edge-weighted graph Gp on V by taking the complete graph and weighting its 
edge uv by eg(Vi, V;)/(|Vi] |Vj|) if u € Vi and v € V;. The case i = 7 takes some 
care: we then count edges twice, so eg(V;, V;) = 2|E(G[Vi])|, and we include the 
case u = v, so Gp will have loops. Informally, Gp is obtained by averaging the 
adjacency matrix over sets V; x Vj. 

The Regularity Lemma says, roughly speaking, that the node set of every graph 
has a partition P into a “small” number of classes such that Gp is “close” to G. 
Figure [9.2] shows a schematic explanation of the Regularity Lemma. 


FIGURE 9.2. Left: adjacency matrix of a graph G, where 1’s are 
replaced by black squares, and 0’s, by white squares. Right: sim- 
ilar representation of Gp for a 3-partition P. The partition is a 
regularity partition if the submatrix in (almost) every rectangle 
V; x V; is random-like. 


Lemma 9.6 (Weak Regularity Lemma). For every simple graph G and every 
k > 1, the node set V has a partition P into k classes such that 


4 
log 


d (G, Gp) < 


q 
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The main point here (and in subsequent related results) is that the error bound on 
the right tends to 0 as k > oo, independently of the graph. 

A proof of this lemma will follow from its matrix form (Lemma|9.7). We do 
not require here that P be an equitable partition. It is not hard to see, however, 
that this version implies that there is also a k-partition in which all partition classes 
are almost equal (in the sense that each class has |n/k| or [n/k] elements), and 
which satisfies the same inequality as in the lemma, just we have to double the 
error bound. 


9.2.2. Cut norm. It will be useful to reformulate the Regularity Lemma in 
terms of matrices, and for this, we need to define norms on the space of nxn 
matrices. For S,T C [n], we define the special matrix 1s, by 


1, ifteS andj eT, 
0, otherwise. 


(lsxr)i,j = 
This is a 0-1 matrix of rank 1. For a matrix M € R"*” and S,T C [n], we define 


M(S,T) = S- Mi j = M-Asxr. 
t€S,jET 


Now we come to defining various norms on matrices. We need the standard 


Frobenius norm: 
ibs 1/2 
Miz = (> M2,) 
i,j=1 
The cut-norm is less standard, and is used mostly in combinatorics. It is defined 
as follows: 


Milo = M1 = M(S,T)|. 
(9.5) \|.|| ne | sxT| max | (S,T)| 


It is easy to verify that this is indeed a norm (most importantly, it satisfies the 
triangle inequality). Applying this notation to the adjacency matrices of two graphs 
G and H on the same set of n nodes, we get 


1 
dg(G, H) = 72 lac — Aull 


There are many alternative ways to define the cut-norm, or norms that are 
closely related. (The proofs are left to the reader as exercises.) We could define it 
by 


(9.6) |M [lo = max{|a'My|: x,y € [0,1]"}. 


If we maximize instead over all vectors in [—1,1]”", we obtain a norm that may be 
different, but only by a factor of 4: 


(9.7) |M|lo < max{\|2'My|: x,y € [-1,1]"} < 4||M|| 


It will be useful to state the upper bound here in a form valid for all vectors 
z,yeER": 


(9.8) Jz"My| < 4l|.M lo [2100 [¥loo- 
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We can also play with the sets S and T in the definition, without changing the 
value too much. For example, 


1 
: _ < = 
(9.9) jlMllo < max M(S,8) < (Milo, 
1 
9.10 =||Mlo < M(S,T) < ||M 
(9.10) jlMllo < max, M(S,7) < [IMllo. 
1 
9.11 —||Mlo < M(S,T) < ||M 
(9.11) lilo < ,max , M(S,T) < |IM|| 


(The last formula is the origin of the name “cut norm”, since the edges counted 
form a cut in the graph.) 


9.2.3. Averaging. For a partition P = {Vi,...,Ve} of [n], vector x € R”, 
and matrix M € R”"*", we define the vector xp and matrix Mp € R"*” by 
r(Vi) M(Vi, Vj) 

Vi [Vil 1V5 

In words, the matrix Mp is obtained by averaging the entries of M over every block 
V, x V; (and analogously for xp). If A is the adjacency matrix of a simple graph 
G, then Ap is the weighted adjacency matrix of Gp. 

Let us say that B € R"*” is a P-matrix, if B is constant on every block V; x Vj, 
i.c., Bp = B. Then the operator M +> Mp is the orthogonal projection of R”*” 
to the linear space of P-matrices. This operator is symmetric and idempotent: 


Mp-N=M-Np=Mp-Np, (Mp)p = Mp. 


(rp )u _ and (Mp) u,v = (u € V;, VE Vj). 


With respect to many matrix norms, it is also contractive: 
(9.12) |Mp|l2<||Mll2, and ||Mpllo < ||| 
(The latter inequality takes some work to prove, see Exercise [9.5]) 


9.2.4. The Weak Regularity Lemma. Using the cut norm and this aver- 
aging operator, we can re-state the Weak Regularity Lemma. In fact, we state it 
in three versions, to make the proof easier to follow. 


Lemma 9.7. Let M € R"*”. 


(a) For every r > 1 there are 2r sets S,,...,5,,T,,...,T, © [n] and r real 
numbers a; such that 


1 
gran 
° 


|M—Sath cn 
i=1 vr 


(b) For every k > 2 there is a partition P of |n] into k classes for which 


|| Mll2- 


4 
|! —Mpllo < Vlog Mlle. 


(c) For every k > 2 there is a partition P of [n] into k classes, and a P-matrix 
B, for which 


|| M— BI| 


2 
< M 
Jick | ll2, 
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Lemma]9.7[a) gives a low-rank approximation of M. The important fact about 
version (a) of the lemma is that the number of terms r and the error bound 1/./r 
are polynomially related. The order of magnitude of the errors cannot be improved 


(see Exercise (9.10). 


Proof. We start with proving (a). Let S,T C [n] be two nonempty sets such 
that ||M||g = |M(S,T)|. For every real number a, we have 


||M—alsxr|l3 = ||Ml|? +47 ||Lsxrll3-2aM -1sxr 
= ||M||? +a°|S||T|—-2aM(s,T), 
which is minimized when a = M(S,T)/(|S||T|), and for this choice 


1 
[S| || 


Repeated application of this inequality gives a sequence of matrices 


(9.18) || —alsxr|l3 = |||? M(S,T)? < ||M|[3—||Mlla- 


M,=M—-)lailsxn, 
i=1 


where 
r-1 


MIS < MIB ->— 0G |l2- 
j=0 


Since the left side is nonnegative, there must be an index 7 < r—1 for which 


us 2 1 
MGI =|M—SPasdsn| < <M. 
i=1 
Replacing the remaining coefficients a;+1,...,@r—1 by 0, we get the decomposition 


in (a). 

Next we prove (c). Consider the decomposition as in (a) with r = |(logk)/2], 
and let P be the partition into the atoms of the Boolean algebra generated by all 
the sets S; and T;. Then |P| < 2?” < k, and the P-matrix B = S>\_, ails,x7, 
satisfies 


1 2 
|| — Bl S GpliMlle S eagle. 


(This last step holds for k > 4; for k = 2,3 we can just take B = 0.) 
To prove (b), we use (9.12) and the fact that Bp = B: 
|M—Mp|lo < IM — Blo +||B—Mplla = | M—Blla + || Bp — Mp|| 
= ||M@—-Bllo+||(B-™M)pllo < 2\|M—-B|| 


Remark 9.8. In (a), one could require in addition that 7, a? < 4, at the cost of 
increasing the error bound to (4/,\/r)||M||z. The trick is to use the maximizing sets 


in (9.10). 


Remark 9.9. The original Regularity Lemma measures the error of the approxi- 
mation A ~ Ap differently. Again there are several versions, closest to ours is the 
following: For a matrix WM € R”*” and partition P = {Vi,...,V,} of [n], and for 
every 1 < 1,7 < k, we choose the sets S;; C V; and T;; C V; maximizing the “local 
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error” |(M—Mp)(5j;,Tj;)|. The “total error” is the sum of these, normalized as 


before: 
k 


1 
e(P) = 5 S> |(M—- Mp) (Siz, Tiy)|- 
i,j=l 
Appropriate modification of the proof above gives the following: There is a partition 
P of [n] into k classes for which 


e(P) = o(F=4) 


(Here log* k is a very slowly decreasing function, defined as the number of times 
we have to apply the log function to & to get a negative number.) This bound 
is also known to be best possible up to a constant factor [Conlon—Fox 2012], 
[Fox-L MLovésr 3014). 


9.3. A-squared representation 


9.3.1. Voronoi cells and regularity partitions. Now we are ready to tie 
regularity partitions to geometric representations. We define the A-squared repre- 
sentation of a graph G as the map i> u,, where u; = Ae; is the column of A? 
corresponding to node i (where A = Ag is the adjacency matrix of G). Squaring 
the matrix seems unnatural, but it is crucial. We define the 2-neighborhood distance 
of two nodes by 


do(s, t) => 


73/3 |Ms — ual: 


This normalization guarantees that the distance of any two nodes is at most 1. We 
need further notation: For a nonempty set S C V, we consider the average distance 
from S: , i 
d2(S) = — do(z, 5) = — min dg(i, 7). 
a8) = 5 Yo dolt 8) = 5D minda(id) 
Example 9.10. To illustrate the substantial difference between the vector repre- 
sentations by the columns of the adjacency matrix and by the columns of its square, 
let us consider a random graph with a very simple structure: Let V = V, UV2, where 
|Vi| = |V2| = n/2, and let any node in V; be connected to any node in V2 with 
probability 1/2. With high probability, the distance between any two columns of 
the adjacency matrix is of the order \/n (approximately \/n/V/2 for two nodes in 
different classes, and ,/n/2 for two nodes in the same class). But if we square 
the matrix, the distance of two columns in different classes will be approximately 
n/?/8, while for two columns in the same class it will be O(n>/4). With the nor- 
malization above, the two classes will be collapsed to single points (asymptotically), 
but the distance of these two points will remain constant. So the 2-neighborhood 
distance reflects the structure of the graph very nicely! 4 


Example 9.11. Let G be obtained by selecting n random points on the d- 
dimensional unit sphere S¢, and connecting two of these points 2 and y if 
X(x,y) < 90°. Then with high probability do(r,y) ~ Vd£(a,y). So, even if 
we don’t use any information except the graph structure, the 2-neighborhood dis- 
tance of nodes is, up to a little perturbation tending to 0 as n > ov, is proportional 
to the geometric distance of the original points on the sphere. In other words, the 
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A-squared representation is the very high dimension n is almost isometric with the 
d-dimensional representation we started with. 

One may object that this is due to the very special construction of the graph. 
However, we can modify it an connect any two points depending on their distance, 
just making sure that we will have an edge density bounded away from both 0 and 
1. 

Another objection can be that the same conclusion would hold (with an even 
simpler proof) for the 1-neighborhood distance. We can further generalize the con- 
struction, by connecting two nodes randomly, where the probability of connecting 
them is an arbitrary continuous function of their distance. (Decisions about con- 
necting pairs of nodes must be made independently, and as before, we have to 
choose this function so that we get an edge density bounded away from 0 and 1.) 
In this case the 1-neighborhood distance between two nodes will be of constant 
order even if the original points are close on the sphere. 

This graph can be used to show that the bound in the Weak Regularity Lemma 
[9.6] is sharp, up to a constant factor. This will follow from Theorem b) below, 
noticing that for any subset S of k = 2% points on $%, at least half of the surface 
will be at a distance larger that 1//d from S. 4 


The following connection between Voronoi diagrams and regularity partitions 


was proved in |Lovasz—Szegedy 2007| (in a somewhat weaker form). 


Theorem 9.12. Let G be a simple graph, and let do(.,.) be its 2-neighborhood 
distance. 

(a) The Voronoi cells of a nonempty set S C V define a partition P of V such 
that dg(G, Gp) < 2d2($)'/?. 

(b) For every partition P = {V,,...,Vx} we can select elements s; € V; so that 
S ={s1,..., 8%} satisfies dy(S) < 4dg(G, Gp)'/?. 


Proof. In both parts of the proof we work with linear algebra, using the 
adjacency matrix A = Ag. In both parts we consider a particular partition 
P = {V.,...,Ve}. We will be interested in the “error” matrix R = A-—Ap, for 
which ||R||p = n?dp(G, Gp). 


(a) Let S = {s1,...,5,}, let P = {Mi,...,Vi} be the partition of V defined 
by the Voronoi cells of S' (where s; € V;), and let ¢(v) = s; for v € V;. To bound 


|| Rl, we use (9.9): 


[Rilo = max a" Ry}. 
x, ye{O,1}¥ 


Let x,y be the maximizers on the right, and let f = x—xp and y = y—yp. The 
crucial equation is 


a'Ry =a'Ay—a'Apy = x2'Ay—axpAyp = «'AG+@' Ayp, 
which implies that 
(9.14) Jz" Ry| < |2||AG| + ly] |Az| < n/? (|Az|+| Gl). 


To connect with the A-squared representation u, we use that 


A’g = S- Yu Uy — S- Yu (Uy —Ug(v)) 
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(since )7 cy, Jv = 0 for every partition class V;). The fact that we have a Voronoi 
partition means that |u, —ugi)| = n°/?de(v, 9) for every node v. Hence 


[A291 < 7 [Goll —tgQ)! < So ay — Ug] = 22/2 So da(v, 8) = n°/7GH(S), 
and 
JAG)? = 97A°G < |g] |A7G] < n¥/?n°/7da(S) = ndo(S). 
We get the same upper bound for | AZ]. ee with (9.14), we get 


do(G, Gp) = loRyl < S 579 (|A@|+1AG1) < 2da(S)/?. 


= 


(b) Let i, 7 be two nodes in the same partition class of P, then Ape; = Ape,, 
and hence A(e;—e;) = R(e;—e,;). Thus 
|u;—u,| = |A?e; — A?e,;)| = |AR(e;—e,;)| < |ARe;|+|ARe,|. 


For every set V; € P, choose a point s; € V; for which |ARe,| is minimized over 
jEV, by j = 5, let S = {51,...,5,}, and let d(i) = 5; for ic Yj. Then 


Ju; — Ug(a)| < |ARe;|+|ARegi)| < 2|ARe;|, 


and using the Cauchy—Schwarz Inequality, 


(9.15) d3(S) = oan < ~ S- da(i, $8) = _ do las vg) 


1EeV ieV ieV 
1/2 
Sp D|AReil < (Do |ARel?) 
tEeV tEeV 


Here by (9.8), 
S| |ARe;|? = tr(RAAR) = tr(ARRA) = )“(e/ A)R(RAe;) 
iEV iEV 


<Al[Rllo D5 |Aeiloo|RAeiloo < 4n”|| RI 
tEV 


and so by (9.15), 


—_ 4 
da(S) < —||RIlq” = 4da(@, Gp)”. 


Combining Theorem with the Weak Regularity Lemma, it follows that 
every graph has an “average representative set” in the following sense. 


Corollary 9.13. For every simple graph G and every k > 1, there is a set SCV 
of k nodes such that d2(S) < 8/(logk)!/4. 


Remark 9.14. In |Lovasz 2012), a version of Theorem |9.12] was formulated using 
the £;-distance between the representing points rather than the 2 (Euclidean) 
distance. This gives a sharper result in the sense that we don’t have to take the 
square root in (b); as a consequence, we get 4/(logk)!/? instead of 8/(logk)'/4 in 
Corollary [9.13] On the other hand, the Voronoi diagram is much nicer, and more 
efficiently computable, for the Euclidean distance (recall F igure [2.10). 
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Remark 9.15. Theorem suggests to define the dimension of a family G of 
graphs as the infimum of real numbers d > 0 for which the following holds: for 
every € > 0 and G € G the node set of G can be partitioned into a set of at most 
e|V(G)| nodes and into at most e~? sets of da-diameter at most ¢. (This number d 
can be infinite.) In the cases when the graphs have a natural dimensionality, this 
dimension tends to give the right value (recall Example[9.11). 


Remark 9.16. It would be tempting to reverse the above treatment: prove Corol- 
lary [9.13] by some more direct geometric argument, and then use Theorem to 
prove the (weak) Regularity Lemma. However, a direct proof of Corollary|9.13]has 
been elusive so far. 


Remark 9.17. Does Corollary [9-13]remain valid if we want to find a set of k nodes 
such that the maximum distance from the set, rather than the average distance, 
can be bounded uniformly by a function of k tending to zero as k — oo? Alon 
(unpublished) proved that the answer is in the affirmative, if we consider the ¢; 
distance instead of the Euclidean distance in the A-square representation, and allow 
a slightly weaker bound of O((loglogk)/,/logk) for the distance from the k-set 
(see for a proof). It is not known at this time whether an analogous 
statement remains true for the Euclidean distance. Again, a geometric proof would 
be very desirable. 


9.3.2. Algorithms for nets and regularity partitions. The results above 
can be applied to the computation of (weak) regularity partitions in the property 
testing model for dense graphs. This topic is treated in detail in [Lovasz 2012), so 
we only sketch the model and the application. 

We need to survey some notions and elementary arguments about packing and 
covering. Let ¢ > 0. A set of nodes S C V is an ¢€-cover, if do(v,S) < © for every 
point v € V. (Here, as usual, do(v,S) = mingegdo(s,v).) We say that S is an 
e-packing, if do(u,v) > € for every u,v € S. An e-net is a set that is both an 
e-packing and an ¢-cover. It is clear that a maximal ¢-packing must be an €-cover 
(and so, and e-net). It is clear that if S is an e-cover, then the Voronoi cells of S$ 
have diameter at most 2e. 

An average €-cover is a set S C V such that >) ¢y da(v,S) < en. An average 
e-net is an average (2¢)-cover that is also an ¢-packing. (It is useful to allow this 
relaxation by a factor of 2 here.) For every average €-cover, a maximal subset that 
is an €-packing is an average e-net. Corollary [9.13] implies that every graph has an 
average €-net with K(e) = 20(/©") elements. 

We will need the following simple lemma. 


Lemma 9.18. Let T,S CV. Then there is a subset S’ C S such that |S’| < |T| 
and d(S’) < d(S)+2d(T). 


Proof. For every point in T choose a nearest point of S, and let S’ be the 
set of points chosen this way. Clearly |S’| < |T|. For every x € V, let ye S 
and z € T be the points nearest to x. Then do(z,S) < de(z,y) < de(x,z)+ 
do(a,y) = de(a,T)+de(x,S), and hence, by its definition, S’ contains a point y’ 
with do(z,y’) < do(#, T) +de(x, S$). Hence 


do(x, S") < do(a,y’') < do(a, z)+do(z,y’) < 2do(x,T) +do(a, S). 


Averaging over x, the lemma follows. 
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Now we turn to algorithms. We assume, as before, that we can generate inde- 
pendent, uniformly distributed random points from V. The 2-neighborhood metric 


a 1 1 
da(i, ) = 51A*(ei—es)| = <q D(A — (A) in 
k 


1 

ne Ye — Ajr) Ark 
k r 

(where A is the adjacency matrix of the graph G) can be written as 


do(i, 2) =Ex | E,. ((Aér _ Ajr) Ark) i: 
where & and r are chosen randomly and uniformly from V. This shows that we 
can approximately compute do(i, 7) by random sampling from V. (We suppress the 
analysis using Laws of Large Numbers and concentration inequalities. ) 

How to construct e-nets? If V is not too large, then we can go through the nodes 
in any order, and build up the e-net S, by adding a new node v to S if do(v, S) >. 
In other words, S' is a greedily selected maximal ¢-packing, and therefore an e-net. 

Average e-nets can be constructed by a simple randomized algorithm even if V 
is large. We build up S randomly: at each step we generate a new random node, 
and add it to S if its distance from S is at least ¢ (otherwise, we throw it away). 
For some pre-specified A > 1, we stop if for A/e consecutive steps no new point 
has been added to S. The set S formed this way is trivially an ¢-packing, but not 
necessarily maximal. However, it is likely to be an average (2¢)-cover. Indeed, let t 
be the number of points « with dg(z,S) >. These points are at distance at most 
1 from S$’, and hence 


2 


S¢ do(x, S) < (nthe +t < nett. 

xEeV 
So if S is not an average (2¢)-cover, then t > en. The probability that we have not 
hit this set for A/e steps is 


The time cost of this algorithm is at most K(e)A/e. 

We can modify this procedure to find an almost optimal average O(¢)-cover. 
Consider the smallest average e-cover T (we do not know which points belong to 
T, of course). Suppose that we can construct a (possibly much larger) average 2e- 
cover S by the procedure described above. By Lemma]9.18] S contains an average 
(3¢)-cover S$’ of size |.S”| < |T]. 

How to find this subset of S? We can try all |T'|-subsets of S. This clearly 
inefficient last step is nevertheless useful: In 20(*(©)) (randomized) time, we can 
construct a set that is an average (3¢)-cover with probability 1—e. (The point is 
that the time bound depends only on ¢, not on the size of V.) 

Once this set S is computed, we can consider the task of computing a weak 
regularity partition solved: by Theorem |9.12| we can take the Voronoi cells of the 
set S, with respect to the metric d, as the classes of this partition. This partition 
P satisfies dg(G,Gp) < 81/2. 

Note, however, that “computing the partition P” does not mean that we com- 
pute a list of all nodes, specifying the partition class it belongs to: we assume that 
the graph is very large, so such a list would be too long. What we want is a way to 
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determine about any given node which partition class it belongs to, in other words, 
which element of S' is closest to it. Since the metric d is computable by sampling, 
this can be done in the property testing model. (Of course, using sampling we can 
only compute an approximation of the metric d, with high probability. We do not 
discuss the issue of estimating the errors here.) 


Remark 9.19. The considerations in Section|9.3.2|work in general for every metric, 
assuming that we have an upper bound on the size of an ¢-packing. 


Exercise 9.1. Find all twin-free simple graphs with rank at most 4. 

Exercise 9.2. Show that for every even r > 2, there is a twin-free simple graph 
on 2”~+ nodes whose adjacency matrix has rank r over GF(2). 

Exercise 9.3. Prove that every graph on n nodes contains two nodes i and j 
with |N(i)AN(j)| < n/2. Show that this bound is best possible. 

Exercise 9.4. Let Gi and G2 be two independently generated random graphs on 
the same set V of nodes, obtained by connecting any two elements of V with prob- 
ability 1/2, independently for different pairs. Prove that with high probability, 
do(G1, G2) = o(1). 

Exercise 9.5. Prove (9.12). Prove that similar inequality holds for any matrix 
norm that is invariant under reordering of the rows and columns. 

Exercise 9.6. Let M be a real nxn matrix, and define ||M||1 = 77 j=. |Mis|- 
(a) Prove that ||M||1 < 2n||M||g. (b) Prove that ||M||1 < 2./n||M|| 
Exercise 9.7. If a metric space has an ¢-cover with N elements, then no (2<)- 
packing can have more than N elements. 


Exercise 9.8. Every ¢-cover of a metric space has a subset that is both an e- 
packing and a (2¢)-cover. 


Exercise 9.9. Let (V,d) be a finite metric space and let T’ and S be e-covers. 
Then there is a subset 9” C S such that |S’| < |T'| and S’ is a (4e)-cover. 


Exercise 9.10. Let S@ denote the d-dimensional unit sphere. Let V be a random 
subset of S* with |V| = n, where we let n — oo in the statements below. Connect 
two points of V if their spherical distance is larger than 7/2, to get a graph G. (a) 
Prove that with high probability, (A%)uo ~ <(u, v)/2m for all u,v € V. (b) Prove 
that with high probability, do(u,v) = Q(4(u,v)/Vd) for all u,v € V. (c) Prove 
that with high probability, every set S C V with d(S) < 1/(8Vd) has at least 
2¢ elements. (d) Prove that with high probability, every partition P for which 
do(G, Gp) < 1/(64Vd) has at least 2% classes. 


Exercise 9.11. Describe how to solve the maximum cut problem in the property 
testing model in the following sense: There is a randomized algorithm A which, 
given a graph G, an error bound € > 0, and a node v € V, returns LEFT or 
RIGHT. For a fixed G and e, the partition into nodes found on the LEFT and 
nodes found on the RIGHT determine a cut containing at least (1—¢) times as 
many edges as the maximum cut. All this happens with probability 1—¢, and the 
computation time is bounded by a function of e. 


CHAPTER 10 


Orthogonal Representations: Dimension 


We have used orthogonal representations, (labelings of the nodes by vectors in 
which nonadjacent nodes are labeled by orthogonal vectors of graphs) in Section 
of the Introduction. In this and the next chapter we study them from two 
different aspects. One can be concerned with the smallest dimension in which 
such a representation exists, or with representations that are “economical” in some 
other sense. It turns out that orthogonal representations are related to a number 
of fundamental properties of graphs, like connectivity, tree-width, stability number 
and chromatic number. 

Perhaps the most natural way to be “economic” in constructing an orthogonal 
representation is to minimize the dimension, which will be the main topic of this 
chapter. We can say only a little about the minimum dimension of orthogonal 
representations overall, but we get interesting results if we impose “nondegener- 
acy” conditions. We will study three nondegeneracy conditions: general position, 
faithfulness, and transversality. 


10.1. Basics 


An orthogonal representation of a simple graph G in R?@ assigns to each i € 
V a vector u; € R®@ such that usu; = 0 whenever ij € E. An orthonormal 
representation is an orthogonal representation in which all the representing vectors 
have unit length. Clearly we can always scale the nonzero vectors in an orthogonal 
representation to unit length, and zero vectors (which are orthogonal to everything) 
can be either excluded or ignored in most cases. 

There is something arbitrary in requiring that nonadjacent nodes be represented 
by orthogonal vectors; why not adjacent nodes? We use here the more standard 
convention, but sometimes we need to talk about an orthogonal representation of 
the complement, which we also call a dual orthogonal representation. 

Note that we did not insist that adjacent nodes are mapped onto nonorthogonal 
vectors. If this condition also holds, then we call the orthogonal representation 
faithful. 


Example 10.1. For d = 1, the vector labels are just real numbers u;, and the 
constraints uju; = 0 (ij € E) mean that no two nodes labeled by nonzero numbers 
are adjacent; in other words, the support of u is a stable set of nodes. Scaling the 
nonzero numbers to 1, we obtain incidence vectors of stable sets. (The connection 
between stable sets and orthonormal representations in higher dimension will be 
the central topic of the next chapter.) 

Since very simple problems about stable sets are NP-hard (for example, their 
maximum size), this example should warn us that orthogonal representations can 
be very complex. ¢ 
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Example 10.2. Every graph has a trivial orthonormal representation in RY, in 
which node i is represented by the standard basis vector e;. This representation 
is not faithful unless the graph has no edges. However, it is easy to perturb this 
representation to make it faithful. Of course, we are interested in “nontrivial” 
orthogonal representations, which are more “economical” than this trivial one. 


Example 10.3. Every graph G has a faithful orthogonal representation in R®, 
in which we label a node by the indicator vector of the set of edges incident with 
it. It is perhaps natural to expect that this simple representation will be rather 
“uneconomical” for most purposes. 4 


Example 10.4. Figure below shows a simple orthogonal representation in 2 


dimensions of the graph obtained by adding a diagonal to the pentagon. 4 
b a,b,e 
c 
a ——-> 
d 
c,d 
e 0 


FIGURE 10.1. An (almost) trivial orthogonal representation 


Example 10.5. The previous example can be generalized. In an orthogonal rep- 
resentation, we can label a set of nodes with the same nonzero vector if and only if 
these nodes form a clique. Let k = y(G) (the chromatic number of the complement 
of G), then there is a family {B,,...,B,} of disjoint complete subgraphs covering 
all the nodes. Mapping every node of B; to e; (¢ = 1,...,k) is an orthonormal 


representation. 4 


Example 10.6. In the introduction we have seen an orthogonal representation with 
a more interesting geometric content. The previous example gives an orthogonal 
representation of C5 in 3-space (Figure [10.2] left). The “umbrella” representation 
defined in the introduction gives another orthogonal representation of the graph Cs 
in 3-space. 4 


d c 


FIGURE 10.2. The graph C; and two orthonormal representations 
of it. 
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Example 10.7. Let L be the Laplacian of a graph G, and let u; (i € V) be the 
columns of the matrix L'/?. Then usu; = L;;, and hence u is a faithful orthogonal 
representation of G. 4 


10.2. Minimum dimension of orthonormal representations 


Our first topic is to construct orthogonal representations in a dimension as low 
as possible. Of course, we can represent every node by the zero vector, to get an 
orthogonal representation in dimension 0. So we come to a very weak nondegeneracy 
assumption: we exclude zero vectors. Nonzero vectors can be normalized to unit 
vectors, so in other words, we want to study the minimum dimension in which a 
given graph G has an orthonormal representation. It is trivial that this dimension 
is at least a(G), since we have to use at least a(G) mutually orthogonal vector 
labels. The construction in Example shows that this minimum dimension is 
at most x(G). 

The minimum dimension of an orthonormal representation is (loosely) tied to 
the chromatic number of the complement. Let us consider the (infinite) graph 
Ha, whose nodes are all unit vectors in R¢, two of them being adjacent if and 
only if the vectors are orthogonal. The chromatic number of this graph is not 
known precisely, but (for large d) rather good bounds follow from the results 
of (upper bound) and [Frankl-Wilson 1981], improved by 
. Not stating the most precise bounds, the following will suffice 


for us: 
(10.1) 1.27 < y(Hy) < 4%. 
Using this, it will be easy to prove the following. 


Proposition 10.8. The minimum dimension d in which a graph G has an or- 
thonormal representation satisfies 


slosx(@) <d< x(@). 


Proof. We have already shown the upper bound. On the other hand, any 
orthonormal representation of G in R¢% gives a homomorphism G —> Hy, and hence 


x(G) < x(Ha) < 4%, 


proving the lower bound in the proposition. 


The bounds above are essentially tight. The theorem of |Erd6s—de Bruijn 1951 


implies that Hq has a finite subgraph H with y(H) = x(Ha) > 1.2%, showing that 
the lower bound is tight up to a constant factor. The graph with d nodes and no 
edges shows the tightness of the upper bound. 

Stated in a different language, proved that no triangle-free 
graph with n nodes can have an orthonormal representation in dimension less than 
n/2 (this is clearly tight, for example, for a complete bipartite graph; see Exercise 
[10.3). The case of general clique number was considered in 
and [Alon-Szegedy 1999], but the upper and lower bounds are rather far apart. 
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10.3. General position and connectivity 


The first nontrivial nondegeneracy condition we study is general position: we 
assume that any d of the representing vectors in R@ are linearly independent. This 
implies that neither one of them is the zero vector. Another way of expressing this 
property is that every subset of the vectors is either linearly independent or spans 
the whole space. 

The main result to be presented in this section 
finds an exact condition for this type of geometric representability. 


Theorem 10.9. A graph G has an orthogonal representation in R¢ in general 
position if and only if G is (n—d)-connected. 


It is useful to remark that a graph is (n— d)-connected if and only if its comple- 
ment does not contain a complete bipartite graph with more than d nodes (where 
for a complete bipartite graph we always assume that its bipartition classes are 
nonempty). So we could formulate this theorem as follows: A graph has a dual or- 
thonormal representation in general position in R® if and only if it does not contain 
any complete bipartite subgraph with d+1 nodes. 

The condition that the given set of representing vectors is in general position is 
not easy to check (it is NP-hard). A weaker, but very useful condition will be that 
the vectors representing the nodes nonadjacent to any node are linearly indepen- 
dent. We say that such a representation is in locally general position. This condition 
implies that every node is represented by a nonzero vector unless it is connected to 
all the other nodes. In this case there is no condition on the representing vector, 
sO we may assume that all vectors are nonzero (equivalently, unit vectors). 

Theorem [10.9] will be proved in the following slightly more general form: 


Theorem 10.10. For every graph G, the following conditions are equivalent: 
(i) G has an orthogonal representation in R@ in general position; 
(ii) G has an orthogonal representation in R® in locally general position; 


(iii) G is (n—d)-connected. 


Two steps of the proof are easy. Consider an orthogonal representation in 
general position, then the vectors representing the non-neighbors of a node v cannot 
span the whole space (since they are all orthogonal to the vector label of v), so 
they are linearly independent. Thus (i)= (ii). To illustrate the connection between 
connectivity and orthogonal representations, we prove that (ii)=(iii). Let Vo be a 
cutset of nodes of G, then V = VoUV, UVa, where Vi, V2 4 0, and no edge connects 
VY, and V2. This implies that the vectors representing V; are linearly independent, 
and similarly, the vectors representing V2 are linearly independent. Since the vectors 
representing V, and V2 are mutually orthogonal, all vectors representing V, U V2 are 
linearly independent. Hence d > |V,; UV2| = n—|Vol, and so |Vo| > n—d (Figure 
10.3). 

The difficult part of the proof will be (iii)=(i): this requires the construction 
of a general position orthogonal representation for (n—d)-connected graphs. We 
describe and analyze the algorithm constructing the representation. As a matter 
of fact, to describe the construction is quite easy, the difficulty lies in the proof of 
its validity. 
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FIGURE 10.3. Locally general position orthogonal representation 
in low dimension implies high connectivity. 


10.3.1. G-orthogonalization. The following procedure can be viewed as an 
extension of the Gram-Schmidt orthogonalization algorithm. Let G be any simple 
graph, where V = [n], and let u: V — R?@ be any vector labeling. Let us choose 
vectors f;, fo,... consecutively as follows. Let fj = u,. Supposing that the vectors 
f; (i < j) are already chosen, we define f; as the orthogonal projection of u; onto 
the subspace L; = lin{f; : i < j, ij € E}+. The sequence (fi,...,f,) is a vector 
labeling of G, which we call the G-orthogonalization of the vector labeling u. It is 
trivial that f is an orthogonal representation of G. If E = 0, then (f,,...,f,.) is just 
the Gram-Schmidt orthogonalization (u1,...,u,). It follows by straightforward 
induction on 7 that 

lin{uy, eas ,u,} = linff;, ear ,f} 
for every j € [n]. 

It is important to note that the G-orthogonalization of a vector labeling depends 
on the ordering of V. In this section we will be concerned with G-orthogonalizations 
obtained through different orderings of a fixed graph G. Ifo: V — [n] defines 
an ordering of the nodes and u: V -> R?@ is a vector labeling, then we denote its 
G-orthogonalization by f% (we omit o if the ordering of the nodes is understood). 

From now on, we consider a random vector labeling u: by this we mean that 
the vectors u; are random vectors whose entries are chosen independently from the 
standard Gaussian distribution. Note that the length of the vector u; influences 
the vector f; only, and only its length; so we could replace the (multidimensional) 
standard Gaussian distribution of u; by any rotationally symmetric distribution in 
the lemmas below (for example, by the uniform distribution on S¢~*). 


Lemma 10.11. Let G be a graph in which every node has degree at least n—d, 
and let u: V —+ R®@ be a random vector labeling. Then the vectors in the G- 
orthogonalization of u (with respect to any ordering of the nodes) are almost surely 
nonzero. 


Proof. For any j, there are at most d—1 vectors f; (i < 7, i7 ¢ E), hence they 
do not span R?, and hence almost surely u, is linearly independent of them, and 
then its projection onto L; is nonzero. 


The main fact we want to prove is the following (which implies Theorem|10.10 
immediately). 


Lemma 10.12. Let G be an (n—d)-connected graph, and let u: V — R® be 
a random vector labeling. Then the vectors in the G-orthogonalization of u (with 
respect to any ordering of the nodes) are almost surely in general position. 


156 10. ORTHOGONAL REPRESENTATIONS: DIMENSION 


To motivate our approach to the proof, let us consider a simple example. 


Example 10.13. Let V = {a,b,c,d} and EF = {ac, bd}. Consider a random vector 
labeling u in R® and compute its G-orthogonalization f, associated with the given 
ordering. Since every node has degree 1, the vectors f; are almost surely nonzero. 
We have f, = Ua, f, € f+, and f, € f+; almost surely, f, will not be parallel to 
f,,, so together they span the plane f;+. This means that fy, which is orthogonal to 
both f, and f., must be parallel to f,, no matter what ug is. 

Now suppose that we reverse the order of the nodes. Let fi, f', ff, f/ be the 
vectors obtained by the G-orthogonalization in this order. Almost surely f’, will not 
be parallel to ff, but f/ will be parallel with f/. So not only are the two distributions 
different, but a particular event, namely f,||fz, occurs with probability 0 in one and 
with probability 1 in the other. 

Let us modify this example by connecting b and c by an edge. Processing the 
vectors in the order (a,b,c, d) again, the vectors f, = ug and fy, will be orthogonal 
again. No condition on f,, so almost surely f. = u, will be linearly independent 
of f, and f,, but not orthogonal to either one of them. The direction of fg is 
still determined, but now it will not be parallel to f,; in fact, depending on fi, it 
can have any direction orthogonal to f,. Doing this in the reverse order, we get 
vectors f), f/, ff, f, that have similar properties. The distributions of (fu, fc, fy, fa) 
and (f%, f/, ,£,) are still different, but any event that occurs with probability 0 in 
one will also occur with probability 0 in the other (this is not quite obvious). 


This example motivates the following considerations. As noted, the distribution 
of the G-orthogonalization may depend on the ordering of the nodes; the key to the 
proof will be that this dependence is not too strong. To define what this means, 
consider two random variables X and Y with values in R™. We say that these are 
mutually absolutely continuous, if for every Borel set B, 


P(xeB)=0 s= P(Y Ee B)=0. 


Informally, if a “reasonable” property of X almost never holds (for example, its 
coordinates almost surely do not satisfy an algebraic equation), then this is also 
true for Y, and vice versa. 

We will repeatedly use the following easy property of being mutually absolutely 
continuous. 


Lemma 10.14. Consider two random variables, each of which consist of two com- 
ponents: X = (U,V) and X'’ = (U',V’), where U,U' € R™ and V,V' € RY. 
Assume that U and U’ are mutually absolutely continuous, and for every Borel set 
A €R™ with P(U € A) > 0, the random variables V|A and V'|A are mutually 
absolutely continuous. Then X and Y are mutually absolutely continuous. 


The main lemma (which will be used later as well) is the following. 


Lemma 10.15 (Main Lemma). Let G be an (n—d)-connected graph, and let 
u: V > R? be a random vector labeling. Then for any two orderings o and tT of 
V, the random variables £° and £7 are mutually absolutely continuous. 


This implies Lemma}10.12} since the first d vectors in the G-orthogonalization 
of a random vector labeling are almost surely in general position, and we can start 
the ordering with any d-tuple. 
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Proof. It suffices to prove this lemma in the case when 7 is the ordering 
obtained from o by swapping the nodes in positions 7 and j+1 (1 < 7 < n—1). 
Let us label the nodes so that o = (1,2,...,n). 

Clearly f? = f7 for 1 < j. The main part of the proof is to show that the 
distributions of (f7,...,£7,,) and (f7,...,£7,,) are mutually absolutely continuous. 
We prove this by induction on j, distinguishing several cases. 

Case 1. j and j+1 are adjacent in G. In this case the vector f;,; does not 
depend on uj; and vice versa, so f? = f7 and f7,, = f7,. 

Case 2. j and j+1 are not adjacent, but they are connected by a path that 
lies entirely in {1,...,7,7+1}. Let P be a shortest such path and let t be its length 
(number of edges), so 2 < t < j. We argue by induction on ¢ (and on j). Let i be 
any internal node of P. We swap j and j+1 by the following steps (Figure}10.4): 
1) Interchange 7 and j, by several swaps of consecutive elements among the 
first 7. 
2) Swap i and j+1. 

3) Interchange j+1 and j, by several swaps of consecutive elements among 
the first 7. 
4) Swap j and i. 


5) Interchange j +1 and i, by several swaps of consecutive elements among the 
first 7. 


oOo 0 Oo Oo e) oOo 0 
i j jtl 
oOo 0 Oo Oo (e) oOo 0 
j i j+l 
Oo Oo O Oo Oo e) oOo 0 
j jtl i 
Oo Oo 0 Oo Oo o) oo Oo 
j+l j i 
oO 0 Oo Oo (e) 00 0 
jt+l i j 
Oo Oo 0 Oo Oo o) oo Oo 


i jtl j 
FiGurRE 10.4. Interchanging j and j+1. 


In each step, the new and the previous distributions of the G-orthogonalized 
vectors are mutually absolutely continuous: In steps (1), (3) and (5) this is so 
because the swaps take place among the first 7 nodes, and we can invoke induction 
on j; in steps (2) and (4), because the nodes swapped are at a smaller distance 
than ¢ in the graph distance, and we can invoke induction on t. 


Case 3. There is no path connecting j to 7 +1 within {1,...,j+1}. This case 
is more tedious but not particularly deep. It suffices to show that the distributions 
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of the pairs of vectors (£7, £7, ,) and (£7, 7,1), conditioned on the previous vectors 
Uj,-..,U;_1, are mutually absolutely continuous. The remaining vectors f? and ff 
(k > 7+1) are generated in the same way from these two sequences, and hence the 
distributions of f7 and f* are mutually absolutely continuous. 

Clearly V \ {1,...,7+1} is a cutset, whence by our hypothesis on the connectiv- 
ity of G it follows that n—j—1>n—dandsoj <d—1. Let B=lin{u,...,u,;_i} 
and C = B+, then t = dim(C) = d—dim(B) > d—(j—1) > 2. 

The random vector u; can be decomposed as u; = b; + ¢;, where b; and c; are 
random Gaussian vectors in B and C, respectively, and b; and c; are independent 
as random variables. The subspace which is orthogonal to all previous nonneighbors 


of j (in the ordering a) is L; = (N” (7). Thus C C Lj. Let bj, be the orthogonal 
projection of b; onto L;. Since b; is orthogonal to C, so is bj, thus bj € B. 

We define Lj41 similarly for the node 7+1 and ordering 7, along with the 
decomposition uj+1 = bj41+¢j41, and the orthogonal projection b/,, of bj+1 
onto L. 

The following observation will be important: 


(10.2) BEL bhi: 


Here we have to use that there is no path in J connecting 7 and 7+1. This 
implies that the set {1,...,7-—1} has a partition W’UW” so that N7(j) C W’, 
N7(j+1) C W”, and there is no edge between W’ and W”. Let B’ = lin(W’) and 
BY = lin(W"), then B’ 1 BY, and L; 1 B”, which implies that bi; € B’. Similarly, 
bj.,, € B”, which implies (10.2). 

To get £7, we have to project u; onto L;; using that C C Lj, the component 
of u; in C remains unchanged, and this projection can be expressed as 


(10.3) fy = bj +c; 
Similarly, we have 


(10.4) fia = bi +Cj41- 


To describe the other two orthogonalized vectors, notice that the only difference 
between f7,, and f7,, is that we have to make f?,, orthogonal to f? as well. Since 
the vectors bi, and bj,, are already orthogonal to each other, as well as to all 
vectors in C’, this simply means that we have to modify by replacing ¢j+1 


by its projection co 4, onto the orthogonal complement of cj. So we can write 


(10.5) fy = bhi tej 
Similarly 
(10.6) f7 =bi+c)}, 


where ci is the projection of c; onto the orthogonal complement of c;+1. 

We can generate these vectors as follows. We condition on the vectors 
uj,...,U;-1, determining the subspaces B and C’. Then we generate indepen- 
dent Gaussian vectors b;,b;+1 € B, determining the vectors bj, and bj, ,. Finally, 
we generate two independent Gaussian vectors c;,c;,1 € C. Both pairs of vec- 
tors (cj,c;,,) and (cj,c;41) belong to the manifold {(x,y): x,y € R?, x ly}, 
and they have positive density functions, which implies that they are mutually 
absolutely continuous. Hence (f7,f7,,) and (f7,£7,,) are mutually absolutely con- 
tinuous. 


10.4. FAITHFULNESS 159 


This completes the proof of Lemma|10.15]}and of Theorem |10.10 


10.4. Faithfulness 


It is probably difficult to decide whether a given graph has a faithful orthogonal 
representation in a given dimension. In other words, we do not know how to 
determine the minimum dimension of a faithful orthogonal representation. What 
we can do is to give almost-trivial upper and lower bounds, and—as an application 
of the results in the previous section—a nontrivial upper bound. 

Let us start with some examples. 


Example 10.16. It is easy to construct a faithful orthogonal representation in 
dimension n—1 of the path P,, with n nodes: we label the nodes in order by e;, e; + 
€2,---,@n—2 +En—1,en—-1- The path P,, has no faithful orthogonal representation in 
lower dimension. Indeed, in any faithful orthogonal representation (v;: i € V(P,), 
for every 1 < k < n—1, the vector vz41 is orthogonal to v1,...,Vvz—1, but not 
orthogonal to vz, which implies that vz is linearly independent of vj1,...,Vx—1. 
Hence the vectors vj,...,Vn—1 are linearly independent, and so the ambient space 
must have dimension at least n—1. 4 


This example motivates the following upper and lower bounds on the dimension 
of a faithful orthogonal representation. We have constructed orthogonal represen- 
tations from cliques covering the nodes of a graph (Example [10.5). These repre- 
sentations are far from being faithful in general. However, using a family of cliques 
covering all edges, we can construct a faithful representation. 


Proposition 10.17. If the edges of a graph G can be covered by k cliques, then it 
has a faithful orthogonal representation in R*. 


Proof. Let {B,,...,B,} be any family of complete subgraphs covering all 
edges of a graph G. We construct the vector labeling u in R*, defined by (u;); = 
1(@i € B;). Then u is a faithful orthogonal representation of G: if i and j are 
adjacent nodes, then they are contained in one of the cliques B;, and hence uju,; > 
0; if they are nonadjacent, then the supports of u; and uj are disjoint, and so 
uj u; = 0. 


It is trivial that a(G) is a lower bound on the dimension of any orthogonal 
representation by nonzero vectors (faithful or not). To strengthen this bound in 
the case of faithful representations, we say that a subset S C V is almost stable, 
if for every connected subgraph H of G[S] with at least one edge there is a node 
j € V\V(A) that is adjacent to exactly one node in V(H). (The node j may or 
may not belong to S.) It is trivial that every stable set is almost stable. 


Proposition 10.18. The dimension of any faithful orthogonal representation of a 
graph G is at least as large as its largest almost stable set. 


Proof. Let v be a faithful orthogonal representation of G in dimension d, and 
let S be an almost stable set. We prove by induction on |S] that the vectors in v(S) 
are linearly independent. If S is a stable subset of V, then this is trivial. Else, let 
H be aconnected component of G[S] with at least one edge, and let 7 ¢ V(H) bea 
node adjacent to a unique node i € V(H). The set S’ = S\i is almost stable, and 
hence the vectors in v(.S") are linearly independent by the induction hypothesis. 
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Next, observe that the vectors v(V(H)\i) cannot span v;; this follows from 
the fact that v; is orthogonal to each of these vectors, but not to v;. Since the 
vectors vx, with k € S\V(#) are all orthogonal to every vector v;, with k € V(#), 
it follows that S’ cannot span v;. This proves that v(S') = v(S’)U {v;} is a linearly 
independent set of vectors. 


From the results in Section |10.3.1| it is easy to derive an upper bound on the 
minimum dimension of faithful orthogonal representations. 


Proposition 10.19. Every (n—d)-connected graph on n nodes has a faithful or- 
thogonal representation in R¢. 


Proof. It suffices to show that in a G-orthogonalized random representation, 
the probability that two adjacent nodes are represented by orthogonal vectors is 
zero. By the Lemma 10.15} it suffices to prove this for the G-orthogonalization 
based on an ordering starting with these two nodes. But then the assertion is 
obvious. 


Using that a graph with minimum degree D cannot contain a complete bipartite 
graph with more than 2D nodes, Proposition}10.19}implies the following bound. 


Corollary 10.20. If the maximum degree of a graph G is D, then G has a faithful 
dual orthogonal representation in 2D dimensions. 


This was proved in [Maehara—Rodl 1990). They conjecture that the bound 
on the dimension can be improved to D+1. Proposition shows that the 
conjecture is true if we strengthen its assumption by requiring that G is (n — D—1)- 
connected. 

We conclude this section with two examples illustrating the use (and insuffi- 
ciency) of the upper and lower bounds above. 


FicureE 10.5. The graph Vg and the triangular grid Ag. 


Example 10.21. Let Vg denote the graph obtained from the cycle Cg by adding 
its longest diagonals (Figure left). This graph is 3-connected, and hence by 
Proposition it has a faithful orthogonal representation in R°. It does not 
have a faithful orthogonal representation in R4 [van der Holst 1996]. But to show 
this is not easy, because it does not have an almost stable set of size 5. (The last 
two facts need some case analysis, not reproduced here.) So the lower bound in 
Proposition [10.18] does not always give the right value. 4 


Example 10.22 (Triangular grid I). Consider the triangular grid Ay, with k > 2 
nodes along the bottom line Figure 10.5] right). This graph has n = Gy nodes. 
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Example|10.16|yields a faithful orthogonal representation in dimension n—k = ey 


the number of dark triangles. This is much better than the upper bound from 
Proposition [10.19] which is n—2. 

This dimension is in fact the smallest possible; this follows by Proposition[10.18] 
since it is easy to see that the set of nodes above the bottom line is almost stable, 
and hence the minimum dimension of a faithful representation is n—k. 

We note that Aj, is planar, so all of its minors are at most 5-connected. So we 
could not hope to salvage the upper bound provided by Proposition{10.17|by looking 
at highly connected subgraphs or minors. The significance of this observation will 
be made more clear in the next section. 4 


The dark .1dve form a family of cliques covering all edges, so the construction in 


10.5. Transversality 


We have seen various nondegeneracy conditions (general position, faithfulness), 
and how they turn a complicated and probably untractable question like the exis- 
tence of an orthogonal representation in a given dimension into a question where 
some sort of answer can be obtained by combinatorial means. In this section we in- 
troduce a more complicated, deeper nondegeneracy condition, which was first used 
in this spirit by Colin de Verdiére. We spend a bit more time on it than what we 
would need in this chapter, as a preparation for its use later on. 

Consider a system of polynomial equations with real coefficients 


(10.7) pi(az) =0,...,pm(x) =0 (a# ER”). 


Each of these equations defines a hypersurface in R”. We say that a solution u 
is transversal, if these surfaces intersect transversally at this point. This means 
that their normal vectors at this point, the vectors grad p;(u),...,gradpm(u), are 
linearly independent. 

We could also rephrase this property as follows: for a linear combination p = 
5, api of the left sides of (10.7), the point u is a stationary point of p (i.e., the 
gradient of p at u is nonzero) only if the linear combination is trivial. 

The most important consequence of this property is the following: if a solution 
y of is transversal, then for every ¢ > 0 there is a 6 > 0 such that changing 
each coefficient of each polynomial by at most 6, the modified system 


(10.8) Pi(x) = 0,..-,fm(x) = 0 


has a solution z such that |z—y| < ¢. Furthermore, the vector z, as a solution 
of the system (10.8), is transversal as well. This follows by the Implicit Function 
Theorem. 

Informally, this means that the solvability of the system of equations 
is not just a numerical coincidence, but it is a “robust” consequence of the struc- 
ture of the equations. We will see that this fact leads to important combinatorial 
applications of this nondegeneracy notion. 


10.5.1. Transversal orthogonal representations. We can view an orthog- 
onal representation i +> u,; € R@ of a graph G as a system of dn variables, entries 
of a matrix U € R?*”,, whose columns satisfy the quadratic equations 


(10.9) uju;=0 (ij € E). 
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We say that the representation of G is transversal, if this solution of (10.9) is 
transversal. Computing the gradient, this says that there are no real numbers X;; 
(ij € E) such that 


(10.10) S- X;juj =0 for every i € V. 

jEN(i) 
Using terminology to be introduced in Chapter[14] the orthogonal representation u 
of G is transversal if and only if (G,u) carries no nonzero homogeneous stress. 

It will be convenient to think of X as a nonzero symmetric V x V matrix such 
that X;; =0 if? = 9 or 7j € EB. In other words, X is a nonzero G-matrix with zero 
diagonal. Note that X has a zero-nonzero pattern complementary to A = Gram(u), 
and condition is equivalent to the matrix equation X A = 0. 

For a row of X that is not all 0, equation means a linear dependence 
between the nonneighbors of i, and if X # O then at least one of these linear 
dependencies is nontrivial. It follows that every orthogonal representation of a 
graph G in locally general position is transversal. This shows that transversality 
can be thought of as some sort of weakened, symmetrized version of locally general 
position. But the two conditions are not equivalent, as the following example shows. 


Cc 


FIGURE 10.6. The graph As with an orthogonal representation 
that is transversal but not in locally general position. 


Example 10.23 (Triangular grid II). Consider the graph A3 obtained by at- 
taching a triangle to each edge of a triangle (Figure (10.6). We have constructed 
an orthogonal representation of this graph in R® (Example 10.22), which is shown 
in the figure. This representation is not in locally general position, since the nodes 
nonadjacent to (say) node a are represented by linearly dependent vectors. But 
it is transversal. Indeed, suppose that a nonzero G-matrix X with zero diagonal 
satisfies (10.10). Since node d is adjacent to all the other nodes except node c, we 
have Xq,; = 0 for 7 A c, and therefore implies that Xq,- = 0. By symmetry, 
Xoq = 0, and hence implies that X.,; = 0 for all 7. Going on similarly, we 
get that all entries of X must be zero. Note that the symmetry of X was important 
in this argument. 4 


10.5.2. Strong Arnold Property. We can forget about the meaning of A 
(in particular, about its positive semidefiniteness) and define the Strong Arnold 
Property of any G-matrix A: this means that XA 4 0 for every nonzero G-matrix 
X with zero diagonal. 

For a general G-matrix A, this definition of the Strong Arnold Property does 
not correspond to a transversality property of a system like (10.9). Nevertheless, we 
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can describe it as transversality with respect to another, more complicated system of 
algebraic manifolds. First, we have to extend the definition of transversality to more 
general manifolds. We say that smooth manifolds (not necessarily hypersurfaces) 
in R” intersect transversally at a point x, if their normal subspaces at this point 
are linearly independent. For m = 2, this means that their normal subspaces at x 
intersect at the zero subspace. 

Consider the manifold R, of all symmetric V x V matrices with rank k. It is 
not hard to see that the dimension of Ry is kn — (2). We need the following lemma 


2 
describing the tangent space of Rz and its orthogonal complement. 


Lemma 10.24. (a) The tangent space Ta of Ry at A € R* consists of all matrices 
of the form AU+U'A, where U ranges through all V x V matrices. 

(b) The normal space N4 of Rx at A consists of all symmetric V x V matrices 
X such that AX = 0. 

Note that U in (a) is not necessarily symmetric, but AU+U™A is. 

Proof. (a) Consider any matrix of the form AU+U™A, and the family 

Ay = (I+1U)"A(T +20). 

Clearly rk(A;) < rk(A) = & and equality holds if |¢| is small enough. Furthermore, 
(£At)e=0 = AU+U'A, showing that AU+U"A € Ty. 

Conversely, let Y € 74. Then there is a one-parameter differentiable family 
A; of symmetric matrices, defined in a neighborhood of t = 0, so that Ay € Re, 
Ag = A and Ap = (£At)t=0 = Y. In a sufficiently small neighborhood of t = 0 
all matrices A; have the same signature of eigenvalues, which implies that they 
can be written in the form A; = ue DU;, where D is a diagonal matrix in which 
the first k, diagonal entries are 1, the next k—k, diagonal entries are —1, and the 
rest are zero, and U; is a nonsingular matrix which depends on ¢ in a smooth way. 
Differentiating, 

Y = Ap = Ud DUp + U4 DU = (Up Uo) "A+ A(Up Uo). 

This shows that Y has the form as stated. 

(b) We have X € N(A) if and only if X-Y = 0 for every Y € Ty. By (a), this 
means that X-(AU+U"'A) = 0 for every U € R"*". This can be written as 

0 = Tr(X(AU+U'A)) = Tr(X AU) +Tr(XU'A) = Tr(X AU) + Tr(AUX) 

= 2Tr(X AU). 

This holds for every U if and only if XA = 0. 


Using this description, we can easily prove the following lemma giving a geo- 
metric meaning to the Strong Arnold Property in the case of general G-matrices. 
Recall that Mg denotes the linear space of G-matrices. 


Lemma 10.25. For a graph G, a G-matrix A has the Strong Arnold Property if 
and only if the manifolds Ry, (k = rk(A)) and Mg intersect transversally at A. 


Proof. Let X be a nonzero matrix in the normal spaces of Mg and Ry, at A. 
The first condition says that X;; = 0 for every ij € HUA; the second says that 
XA = 0 (by Lemma[l0.24). These are just the conditions on X needed to disprove 
the Strong Arnold property. 
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— Aj=0 VijeE 


— rk(A) = rk(M) 


FicureE 10.7. The Strong Arnold Property 


We note that every matrix A with corank at most 1 has the Strong Arnold 
Property automatically. Indeed, if AX = 0, then X has rank at most 1, but since 
X is symmetric with 0’s in the diagonal, this implies that X = 0. 

It is not hard to prove that whether or not a G-matrix A has the Strong Arnold 
Property depends only on the graph G and on the nullspace of A (see Exercise 
(10.10). A useful combinatorial property of the nullspace is the following. 


Lemma 10.26. Let G be a graph, let A be a G-matrix with the Strong Arnold 
Property. Let x,y € Ker(A). Then either supp(x) and supp(y) intersect, or there 
is an edge of G connecting them. 


As a special case, we see that G has at most one connected component such 
that the corresponding diagonal block of A is singular. 


Proof. Suppose not, then X = ry'+yzx' is a nonzero G-matrix with zero 
diagonal satisfying MX = 0, contradicting the Strong Arnold Property. 


Coming back to orthogonal representations, we have seen that transversality is 
weaker than locally general position; but a condition in the spirit of locally general 
position does follow from the previous lemma. 


Corollary 10.27. Let G be a simple graph and let S,T C V be disjoint subsets such 
that no edge of G connects them. Let u be a transversal orthogonal representation 
of G. Then either u(S) or u(T) is linearly independent. 


As a special case, two nonadjacent nodes cannot be both represented by the 
zero vector. 


10.5.3. Algebraic width. Based on transversality, an interesting graph in- 
variant related to connectivity was introduced in (this is 
different from the better known Colin de Verdiére number to be discussed in Chap- 
ter 16). He formulated this notion in terms of matrices, but from our perspective, 
working with orthogonal representations is more natural. 

It will be useful to introduce an extension of the notion of faithful represen- 
tations, based on [van der Holst 1996]. We would like to “interpolate” between 
faithful and nonfaithful orthogonal representations, by allowing “weak” edges for 
which the inner product may be zero or nonzero. Van der Holst suggests to encode 
this requirement by considering these edges as edges with multiplicity larger than 
one. This seems to be somewhat arbitrary, but it will turn out to be quite useful. 
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Given a multigraph G, let Ag = (a;;) be its adjacency matrix, where a;; denotes 
the number of edges connecting nodes 7 and j. We define a faithful orthogonal 
representation of it as a vector labeling u: V > R¢, such that 


(10.11) ie ee eS 
#8 x 0, if aig = ‘Its 


There is no condition when i = j or when 7 and j are connected by more than one 
edge. 

If the graph is simple, this gives the previously studied notion of faithfulness. 
If all edges have multiplicity larger than one, then we get all orthogonal repre- 
sentations. (The value of the multiplicity does not matter, once it is larger than 
one.) For every graph G, we define G= as the graph obtained from G by doubling 
all edges. Then faithful orthogonal representations of G~ are just all orthogonal 
representations of G. 

Let d be the smallest dimension in which a multigraph G has a faithful transver- 
sal orthogonal representation, and define Waig(G) = n—d. We call Waig(G) the 
algebraic width of the graph (the name refers to its connection with tree width, see 
below). This definition is meaningful, since it is easy to construct a faithful orthog- 
onal representation in R”, in which the representing vectors are almost orthogonal 
and hence linearly independent, which implies transversality. The algebraic width 
measures how much we can save in the dimension of a faithful transversal orthogonal 
representation, relative to the trivial dimension n. 

This definition can be rephrased in terms of the Gram matrix of the represen- 
tation. For any faithful transversal orthogonal representation (u; :4e V), the 
matrix N = Gram(u) has the following properties: 


=0 iftAj and a; =0, 
£0 ifay=1; 

(N2) N is positive semidefinite; 

(N3) If X is a nonzero G-matrix with zero diagonal, then NX 4 0. 


Conversely, every matrix N with these properties can be written as the Gram 
matrix of a vector labeling (by (N2)), and it is easy to check that this vector labeling 
is a faithful transversal orthogonal representation. The rank of the matrix is the 
dimension of the representation. Hence we get the following reformulation: 


(N1) Ni; 


Lemma 10.28. The algebraic width of a graph G is equal to the maximum corank 
of a matrix with properties (N1)—(N3). 


Example 10.29 (Complete graphs). The Strong Arnold Property is void for 
complete graphs, and every representation is orthogonal, so we can use the same 
vector to represent every node. This shows that Waig(K,) = n—1. For every 
noncomplete graph G, Waig(G) < n—2, since a faithful orthogonal representation 
requires at least two dimensions. 

Doubling edges makes a difference: to get a faithful orthogonal representation 
of K;, we can represent all nodes by the null vector. The Strong Arnold Property 
is void again, which shows that waig(K;,) =n. 4 
Example 10.30 (Edgeless graphs). To have a faithful orthogonal representation, 
all representing vectors must be mutually orthogonal, hence Wajg(K,) = 0. It is 
not hard to see that every other multigraph G has Waig(G) > 1. 4 
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Example 10.31 (Paths II). Let P,, denote the path with n nodes, and assume 
that n > 2. Every matrix N satisfying (N1) has an (n—1)x(n—1) nonsingular 
(triangular) submatrix, and hence by Lemma[10.28} Walg(Pn) <1. Since P,, has an 
edge, we know that equality holds here. 4 


Example 10.32 (Triangular grid II). To see a more interesting example, let us 
have a new look at the graphs A, in Figure [10.5] (Example 10.22). For k = 3, we 
have seen that A3 has a faithful transversal orthogonal representation in R*, and 
this is clearly best possible, so Waig(A3) = 3. 

For general k, the construction in Example yields a faithful orthogonal 
representation u. Recall that each coordinate of u; corresponds to one of the dark 
triangles B, and it is 1 if 2 € B, and 0 otherwise. We show that this representation 
is transversal. Let X be a nonzero G-matrix with zero diagonal disproving the 
transversality of u. This means that 5> j Xjj;u; = 0 for every node 7, which implies 
that 


(10.12) S> Xz =0 
jeB 
for every dark triangle B and node i. 

We prove that X;; = 0 for all i,j € V = V(Ax) by induction on the distance 
of ¢ and j (in the sense of 2-dimensional Euclidean distance in the figure). If i and 
j are neighbors, then this is clear, since X is supported on E = E(A,). Suppose 
that ij € FE, then the interior segment connecting i and j intersects a dark triangle 
incident with either i or 7; say it intersects triangle B = jkl. Then both k and / are 
closer to i than 7 is, and hence we already know that X;, = Xj = 0. By (10.12), 
this implies that X,; = 0. 

So u is a faithful transversal orthogonal representation of A;. The dimension 
of this representation is the number of dark triangles, which is clearly Co. We 
have argued that this is the smallest possible dimension (this follows just from 
faithfulness, not using transversality). Thus 


iat eae ap _ (‘)- ("") he 
+ 


We continue with some easy bounds on the algebraic width. The condition that 
the representation must be faithful implies that the vectors representing a largest 
stable set of nodes must be mutually orthogonal nonzero vectors, and hence the 
dimension of the representation is at least a(G). This implies that 


(10.13) Walg(G) < n—a(G) = 7(G). 

By Theorem |10.10} every k-connected graph G has a faithful general position or- 
thogonal representation in R"~", and hence 

(10.14) Walg(G) > K(G). 

We may also use the Strong Arnold Property to bound the algebraic width. There 
are (5) —m orthogonality conditions, and in an optimal representation they involve 
(n—Waig(G))n variables. If their normal vectors are linearly independent, then 


(5) —m < (n—Waig(G))n, and hence 


(10.15) Walg(G) <—— + 


n+1loem™ 
=~ 2 n- 
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The most important consequence of the Strong Arnold Property is the follow- 
ing. 


Theorem 10.33. The algebraic width Waig(G) is minor-monotone: If H is a minor 
of G, then Walg(H) < Waig(G). 


Proof. It suffices to consider three cases: when H arises from G by deleting an 
isolated node, by deleting an edge, or by contracting an edge. We consider a faithful 
transversal orthogonal representation u of H in dimension d = |V(H)|—Waig(#), 
and modify it to get a faithful transversal orthogonal representation of G. 


The case when H arises by deleting an isolated node 7 is trivial: Starting with 
an optimal representation of H, and representing i by the (d+1)-st unit vector, 
we get a representation of G in dimension d+1. It is straightforward to see that 
this faithful representation of G is transversal. Hence Waig(G) > n—(d+1) = 
|V(H)|—d = Waig(H). 

Suppose that H = G\ab, where ab € E. We start with an optimal representa- 
tion i+ u,; of H, this satisfies the conditions 


(10.16) ujuj=0 (ij € E(H)). 

By transversality, for a sufficiently small ¢ > 0, the system of equations 
(10.17) xix;=0 (ij € E(H)\ab) 

(10.18) x!x, =e 


has a solution in x; € R?, i € V such that |u;—x,| < e for all i € V, and which is 
transversal with respect to the system (10.17)-(10.18). If € is small enough, then 
this representation is a faithful orthogonal representation of G, and it is transversal 
with respect to (10.17), showing that Waig(G) > n—d = Walg(H). 

The most complicated case is obtained when H arises from G by contracting 
an edge ab to a node. It will be convenient to assume that N(a)MN N(b) = @ (we 
can delete the edges connecting b to N(a); this does not change the contracted 
graph, and does not increase Waig(G), as we have just proved). Let A = N(a)\b, 
B=N(b)\aand R=V(H)\A\B. We identify b with the node of H obtained by 
the contraction, and add a new isolated node a to H, to obtain a graph G’. We 
have V(G’) = V, and G’ is obtained from G by shifting all ends of edges from a to 


b (Figure [10.8). 


FIGURE 10.8. Minor monotonicity of the algebraic width 
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We start again with a faithful transversal orthogonal representation (u; 2 4e 
V(H)) in dimension d = |V(H)|—Waig(H) of H, and want to construct a faith- 
ful transversal orthogonal representation of G in dimension d+1. We start with 
constructing a vector labeling of G’ in R¢*!: 


We have seen above that this is a faithful transversal orthogonal representation 
of G’. By the Implicit Function Theorem, for a sufficiently small ¢ > 0 there is 
a vector labeling (z; : i € V) such that z; is arbitrarily close to v;, and z is a 
transversal solution of the system of equations 

(10.19) zi2;=0 (ij € E(H) ori=a,j€ BUR), 


(10.20) Z4Zj — E22; = 0 (j € A). 


Define another vector labeling of V by 


Za—€Z, ifi=Od, 
wi = . 
Zi, otherwise. 


We claim that (w;: i € V) is an orthogonal representation of G. Let ij € E(G). 
If i,j # 6, then w/w; = 0 follows immediately from (10.19). If (say) i = b and 
j © R, then 

WiW; = (Za—€%p) 'Z; = 212; —€2,Z; = 0 


by (10.19). Finally, if 7 = b and 7 € A, then 


w,W; = (Za—€z)'z; =0 


by (100). 


Second, we show that w is faithful, if ¢ is small enough, i.e., w)w,; 4 0 for all 


ij € E. If i,j 4,6, then this is clear, since w/w; ~ v/v; = uju; 4 0 (the error 
at the © sign can be made arbitrarily small). If 77 = ab, then wi w; = wlw, = 


Z Za —€2%1 2, # 0 if € is small enough. If i =a and j € A, then w/w; = ez] zp 4 0 


(since ic € E(H)). If i = b and j € B, then wiw,; = (Za—e%-)'2; = ezla; & 
eviv; =culu,; 40. 

Finally, we show that (w; : 7 € V) is transversal (as an orthogonal represen- 
tation of G), if ¢ is small enough. Suppose that there exists a nontrivial linear 
combination 


f(x) = D0 Xgxlx; 
ijeE 
of the orthogonality conditions such that w is a stationary point of the function f. 
The substitution 


Ya—€Yb; if 7 = 6, 
xy ‘ 
Yi otherwise 
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maps w onto z, and it turns f into the function 


gy) = >> Xuylyst+ SY) Xvjlva—eyo)"y; 
ij€E jE AUR 
i,g4b 
= So Xyylyjs+ Do Xoiyays—€ D> Xoiydys + D5 Xj (ya — evs) "ys. 
sa jEeR jeER jEA 
tJ 


The function g is a linear combination of the left hand sides of the equations (10.19)- 
(10.20), and z is a stationary point of this linear combination. Since z is transversal, 
this linear combination must be trivial. The last sum is the only one where the 
functions (ya—ey»)"y; (j € A) are used, and hence X,; = 0 for j € A. Similarly, 
the third sum gives that X,; = 0 for j ¢ R. This means that all coefficients in the 
second sum are 0. This leaves us with the first sum, so all coefficients there must 
be 0 as well. But this is a contradiction, since f(x) was assumed to be a nontrivial 
linear combination. 


10.5.4. Algebraic width, treewidth and connectivity. We start with sur- 
veying several minor-monotone graph parameters related to connectivity. 

The connectivity of a minor of a graph can be larger than the connectivity of 
the graph itself. (A trivial example is a subdivision of a highly connected graph, 
which is only 2-connected, but by edge contraction we get the original graph as 
a minor.) Thus it makes sense to define the monotone connectivity Kmon(G) of a 
graph G as the maximum connectivity of any minor of G. 

Tree-width was introduced in as an important ele- 
ment in their graph minor theory. We try to decompose our graph G as the union 
of subgraphs G;, which are indexed by the nodes of a tree T, with the following 
property: if 7,7,& are three nodes of T and 7 lies on the path between 7 and k, 
then V(G;)NV(G) C V(G;). Such a family is called a tree-decomposition of the 
graph G. The tree-width of a graph G is the smallest integer k for which G has a 
tree-decomposition into parts with at most k+1 nodes. It is not hard to see that 
&(G) < wer(G), which implies 


(10.21) Kmon(G) < Wer(G). 


A closely related parameter, which we call the product-width wp,(G) of a graph 
G, was also defined in [Colin de Verdiére 1998a]. This is the smallest positive in- 
teger r for which G is a minor of a Cartesian product K;UOT’, where T is a tree. 
The reason why we consider the product width a “connectivity-related” parameter 
is that it is almost equal to treewidth: 


(10.22) Wer(G) < Wor(G) < wir(G) +1. 


The lower bound was proved in [van der Holst 1996], the upper, in 
[Colin de Verdiére 1998al. The proof of these inequalities is left to the reader as a 
(not quite easy) exercise. 

Monotone connectivity, product width and treewidth are minor-monotone 
graph parameters; this is trivial for the first two, and well known for treewidth. 
It is easy to verify that a graph is a forest if and only if either one of these param- 
eters is at most 1. 

So far, we have discussed connectivity-type graph parameters; now we relate 
them to the algebraic width, which will be sandwiched between two of them: 


170 10. ORTHOGONAL REPRESENTATIONS: DIMENSION 


Theorem 10.34. For every graph G, 
Kmon(G) < Waig(G) < Wpr(G). 


The upper bound was proved in {Colin de Verdiére 1998a], while the lower 
bound is an easy consequence of the results in Section 
Proof. Let H be a minor of G with largest connectivity. Using (10.14) and 
the fact that Waig(G) is minor-monotone, we get the first inequality: 
Kmon(G) = K(H) < Walg (7) < Walg(G). 


To prove the second, we use again the minor-monotonicity of Waig(G): it suffices to 
prove that 


(10.23) Walg( Ky T) < r 


for any r > 1 and tree T. Note that if T has at least two nodes, then K;4, can 
be obtained as a minor of K,OT, and hence Waig(K,OT) > Waig(K,+41) = 7 by the 
minor monotonicity of algebraic width. So for every tree T with at least one edge, 


inequality (10.23) holds with equality. 
Inequality (10.23) says that the minimum dimension in which K,UT has a faith- 


ful transversal orthogonal representation is at least (n—1)r, where n = |V(T)|. In 
the argument below, we only need the faithfulness of the representation, transversal- 
ity is not used in this part. Using Proposition[10.18| it suffices to show that K,OT 
has an almost stable set of size (1—1)r. We claim that for any node u € V(T), the 
subset S = (V(T)\u) x V(K,) is almost stable. 

Let H be a connected subgraph of (K,OT)[S]. Considering u as the root of T, 
let uv be a node of T closest to wu such that H contains a node from {v} x V(K;,). 
Note that all nodes of H are contained in sets {w} x V(K,) where w =v or w isa 
descendant of v. Let (v,i) be a node of H, and let v’ be the parent of v in T. Then 
(v’,7) is a node of KOT adjacent to (v,i), but to no other node of H. This proves 
that S is almost stable, and thus completes the proof of the theorem. 


For small values, equality holds in Theorem |10.34{ this was proved in 
van der Holst 1996) and in |Kotlov 2000}. 


Proposition 10.35. Let G be a graph with at least one edge. If Waig(G) < 2, then 
Kmon(G) = Walg(G) = Wpr(G). 


Proof. Using Theorem |10.34] (an easy special case) and the remark above 
about forests, we get 


Wale(G) =1 => Kmon(G)=1 => G is a forest => wpr(G) =1 => waig(G) =1, 
and this proves the assertion for Waig(G) = 1. Suppose that waig(G) = 2, then 
Kmon(G) < 2 by Theorem Furthermore, neither K4 nor the graph Ag in 
Example [10.23] is a minor of G, since Walg(H4) = Waig(As) = 3. 

By Exercise [10.13] G satisfies wp,(G) < 2. We must have equality, since other- 
wise Waig(G) = 1 would follow. 


A planar graph has kmon(G) < 5 (since every simple minor of it is planar and 
hence it has a node with degree at most 5). Since planar graphs can have arbitrarily 
large algebraic width (see Exercise[10.15), the first inequality in Theorem|[10.34]can 
be arbitrarily far from equality: no function of Kyo, bounds Wai, from above. 
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The following example from {van der Holst_1996] shows that in general, equality 
does not hold in the upper bound in Theorem }10.34| either. 


Example 10.36 (Wagner graph). Consider the Wagner graph Wg obtained from 
the cycle Cs by adding its longest diagonals. Then kKmon(Ws) = Waig(Ws) = 3 and 
Wir(Wg) = Wpr(Ws) = 4. These facts take some work to verify, especially the value 
of the algebraic width. 4 


As a weak converse of the inequality Waig < Wpr in Theorem|10.34| the product 
width (or tree-width) is bounded by some function of the algebraic width: For every 
graph G, we have 


(10.24) Wpr(G) = O(Waig(G)””). 


indeed, Example|10.32]implies that G cannot contain the triangular grid Ay,..(@)+1 
as a minor. A basic result of graph minor theory |Robertson—Seymour 1986] says 
that the treewidth of such a graph is bounded by an appropriate function of the 


grid size. The bound used in (10.24) was proved in |Chuzoy 2016}. 


10.6. The variety of orthogonal representations 


In this section we take a global view of all orthogonal representations of a graph 
in R¢. For small values of d, several of our considerations might be easier to follow 
in terms of the complementary graph G. In particular, G is (n—d)-connected if 
and only if G does not contain a complete bipartite graph on d+1 nodes (i.e., a 
complete bipartite graph K,» with a,b >1,a+b=d+1). 

Let G be a simple graph on n nodes, which are labeled 1,...,n. Every vector 
labeling i + v; € R¢@ can be considered as a point in R®, and orthogonal rep- 
resentations of G in R@ form an algebraic variety ORa(G) C R%, defined by the 
equations 


(10.25) ujuj=0 (ij € E). 


We could consider orthonormal representations, whose variety is defined by the 
equations 


(10.26) uuj=l (EV), uju;,=0 (je £). 


The set of orthogonal representations of G that are not in general position also 
forms an algebraic variety: we only have to add to (10.25) the equation 


(10.27) [[ — det(ui,-.-,ui,) = 0. 
1<i<-<ia<n 

Similarly, the set of orthogonal representations that are not in locally general posi- 
tion form a real algebraic variety. The set GOR (G) of general position orthogonal 
representations of G in R¢, as well as the set COR,(G) of locally general position 
orthogonal representations of G, are semialgebraic. 

Let us a look at some examples showing the complications (or interesting fea- 
tures, depending on your point of view) of the variety of orthogonal representations 
of a graph. 


Example 10.37. Consider the graph K consisting of two isolated nodes. Every 
orthogonal representation of this graph consists of two orthogonal vectors. It follows 
that in dimension d = 1, one of the labels must be 0; the other one is arbitrary. So 
the variety of orthogonal representations decomposes into two subvarieties, which 
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intersect at the point (0,0). This graph has no orthonormal representation in 
dimension 1. 

For d = 2, we look at pairs (uj, uz) of orthogonal vectors. In the orthonormal 
case, this implies that ug arises from uy; by rotation of 90° either in the positive or in 
the negative direction. In other words, either (ui); = (ug)2 and (u1)2 = —(ug)1, or 
(ui)1 = —(ug)2 and (uj1)2 = (ug)1. So the variety of orthonormal representations 
decomposes into two algebraic varieties. It is not hard to see that these two are 
irreducible. 

Considering, more generally, orthogonal representations in the plane, their va- 
riety decomposes into two semialgebraic sets in a natural way, depending on the 
sign of det(u,, uz). These two semialgebraic sets intersect in the set of degenerate 
labelings {(u,,u2) : uy, = 0 or ug = O}. These two semialgebraic sets are not 
algebraic however. 

For d > 2, both varieties (orthogonal and orthonormal representations) are in- 
decomposable. Orthonormal representations of K,, in R¢ form the Stiefel manifold 
of n-frames in R¢. 4 


Example 10.38. Let us look at the structure of orthogonal representations of an 
arbitrary graph G in one dimension. We have a single variable x; assigned to each 
node i, and the orthogonality conditions say that x;2; = 0 for ij € E. This is 
trivially equivalent to saying that the support of the vector « = (a; : 1 € V) is 
a clique. So this variety decomposes into as many subvarieties as the number of 
maximal cliques. 

This shows that the variety of orthogonal representations can have a very com- 
plex structure even in the smallest dimension. For example, the dimension of this 
variety is w(G), an NP-hard quantity. 4 


Example 10.39. Our last example is nontrivial, and it will play an important role 
later in this section. Let Q® denote the graph of the (ordinary) 3-dimensional cube. 
Note that Q® is bipartite; let A = {a1,a2,a3,a4} and B = {b1,b2,b3,b4} be its 
color classes. The indices are chosen so that a; is opposite to 6; (Figure [10.9] (a)). 

Since Q* contains neither Ky,4 nor K23, the graph Q? is 4-connected, and 
hence it has a general position orthogonal representation in R*. This is in fact easy 
to construct. Choose any four linearly independent vectors x,,...,x4 to represent 
a1,...,@4, then the vector y; representing b; is uniquely determined (up to scaling) 
by the condition that it has to be orthogonal to the three vectors x;, 7 # i. It 
follows from Theorem [10.10] that for almost all choices of x,,...,x4, the orthogonal 
representation obtained this way is in general position. 

For bipartite graphs G, we can think of orthogonal representations of G in R@ 
in another way. Vectors representing one bipartition class can be thought of as 
homogeneous coordinates of points in projective (d—1)-space, while vectors in the 
other class can be thought of as homogeneous coordinates of planes. Orthogonal- 
ity is then translated to incidence. So to construct a general position orthogonal 
representation of Q3 in R*, we need four points in general position in 3-space, and 
four planes, each going through exactly three of these points. In other words, we 
get the vertices and faces of a tetrahedron (Figure [10.9] (b)). 

We show that every orthogonal representation of this graph in general position 
satisfies a certain algebraic equation. Let x, denote the orthogonal projection of 
x; onto the first two coordinates, and let y; denote the orthogonal projection of y; 
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FIGURE 10.9. (a) The cube as bipartite graph. (b) An orthogonal 
representation in general position, depicted as a tetrahedron in 
projective 3-space. The homogeneous coordinates of the vertices 
represent one color class, those of the facets represent the other. 
(c) A degenerate orthogonal representation. 


onto the third and fourth coordinates. We claim that 


det (x, x5) det(x4, x/,) _ det(y},y3) det(y$, y4) 
det (x, x’/,) det(x,x5)  det(y{, y4) det(y, y3) 


(10.28) 


Before proving (10.28), we remark that in a geometric language, this equation 
says the following: Let @ be a line in projective 3-space that intersects the planes of 
the faces of a tetrahedron T at four distinct points x, X2,X3,X4. Let 41, Ne, 43, U4 
be the four planes spanned by € and the vertices of T, indexed so that the face 
defining x; is opposite to the vertex defining 4;. Then the cross ratios (x, : X29: 
x3 : x4) and (Oy : Hg: U3: U4) are equal (see Appendix [A.4). 

To prove (10.28), we may assume that x}y; = 1 for i = 1,...,4. This can 
be done, since x;y; # 0 (else, x1,...,x4 would all be contained in the 3-space 
orthogonal to y;, contradicting the assumption that the representation is in general 
position), and the equation is invariant under scaling. This means that the 4x 4 
matrices X = (x),X2,x3,x4) and Y = (yj, y2,y3, y4) satisfy X'Y = J. By the 
well-known relationship between the subdeterminants of inverse matrices, we have 


det (x}, x3) = det(X) det(y},y4), 
and similarly for the other determinants in (10.28). Substituting these values, we 


get after simplification. 

There is another type of orthogonal representation of this graph, in which the 
elements of U are represented by vectors x ,,...,X4 in a linear subspace L of R+ 
and the elements of W are represented by vectors yi,...,y4 in the orthogonal 
complement L+ of L. Clearly, this is an orthogonal representation for any choice of 
the representing vectors in these subspaces. This representation is never faithful. 
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As a special case, we can choose 4 vectors x; in the plane spanned by the first 
two coordinates, and 4 vectors y; in the plane spanned by the third and fourth 
coordinates. Since these choices are independent, equation [10.28] will not hold in 
general. (In the projective representation, we get four collinear points and four 
planes incident with their line; Figure[10.9| (c)). 

It follows from these arguments that the equation 


det (x1, X2, X3, X4) ( det(x',, x3) det(x}, x4) det(y), y4) det(y5, y3) 
— det (x1 ,x4) det(x2, x4) det(y2,y4) det(y1,y4)) = 0 


is satisfied by every orthogonal representation, but the first factor is nonzero for 
the orthogonal representations in general position, while the second is nonzero for 
almost all of those of the second type. So the variety ORg(Q?) is reducible. 4 


10.6.1. G-orthogonalization revisited. For any graph G with ordered 
nodes, and any vector labeling x € R¢*”, the G-orthogonalization procedure de- 
scribed in Section assigns an orthogonal representation ®x ¢ R¢*”. The 
mapping ®: R¢*" —> R¢X” is well-defined, and it is a retraction: if x itself is 
an orthogonal representation, then ®x = x. In particular, the range of ® is just 
ORa(G). But the map ® is not necessarily continuous (check Example[10.37). 

We can modify this procedure so that we get vectors whose coordinates are 
polynomials of the original coordinates. Given x € R?*", we define the vectors 
Zi € R? recursively by 


x; Zi, Zi, 
(10.29) 24 = 7s ie cade 
2} x; 2} i, adele Zi Zi, 
where N(j)M[j] = {t1,---,7-}. (In this compact notation, the first row consists 


of vectors, the later rows consist of numbers. Expanding the determinant gives a 
meaningful expression.) This defines a map UV: xz. 
Obviously, z; is a linear combination 


e 
(10.30) Bj = jXjy+) | BiaBir, 

t=1 
and it is orthogonal to z;,,...,Z;i,, since taking inner product with any of these 


vectors, we get a determinant with two equal rows. It follows that z is an orthogonal 
representation of G for every x € R?*". It also follows by induction on j that each 
entry of each z; is a polynomial in the coordinates of x. 

We want to compare the vectors z with the vectors y = ®x. The following 
lemma sums up some simple properties of this modified G-orthogonalization pro- 
cedure. 


Lemma 10.40. With the notation above, z; = 0 if and only if x;,2:,,..-,2i, are 
linearly dependent. The vector z; is a nonnegative scalar multiple of y;. 


Proof. If x;,z;,,...,2;, are linearly dependent (in particular, if either one of 
them is zero), then z; = 0, since the columns of the determinant (10.29) are linearly 
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dependent. If x;,2i,,...,2;, are linearly independent, then the linear combination 
as in (10.30) is unique, and 
c. T 
ZjZiy +++ 2j,Bi, 
aj=| : : > 0, 
T T 
Zj Zi, Z; Zi, 


so z; #0. 
For the second assertion, we prove by induction on j that if z; # 0, then 
Z; = ajy;. We know by the first assertion of the lemma that z;, 4 0 for 1 <t<r, 


and hence z;, = ai, yi, by induction. It follows that z; is orthogonal to yi,,...,yi,, 
and 
1 “ B54 
—g,=x,;+ i EL Sy 
a; ? i do a; Viz 


Since these properties determine y,;, we get that (1/a,;)z; = y;. 


Suppose that we scale one of the vectors x; by a scalar A; # 0. From the 
orthogonalization procedure we see that y; does not change for i # j, and y; is 
scaled by A;. Trivially, the vectors z; do not change for i < j, and z; will be scaled 
by A;. The vectors z; with i > 7 change by appropriate factors that are powers of 
Aj. It follows that if we want to scale each z; by some given scalar A; 4 0, then 
we can scale x,,...,X, one by one to achieve that. As a consequence, we see that 
the range Rng(W) is closed under rescaling each vector independently by a nonzero 
scalar. 

Every orthogonal representation is a fixed point of ®, and so Rng(®) = OR(G). 
The range of UW depends on the ordering of the nodes: an orthogonal representation 
is in the range of W if and only if for every node with z; ¢ 0 the vectors representing 
the nodes in N(j)M{1,...,j7—1} are linearly independent. We have 


(10.31) Rng() C Rng(®) = OR(G), 


and Example shows that equality may not hold even for (n—d)-connected 
graphs. 

Every orthogonal representation in locally general position is in the range of 
W, so we have GOR(G) C LOR(G) C Rng(W). This is uninteresting if G is not 
(n—d)-connected (in this case GOR(G) = LOR(G) = 0). Let us assume that G is 
(n—d)-connected, then ®x is in general position for almost all x by Lemma|[10.12] 
and hence so is Wx. Since W is continuous, it follows that GOR(G) is dense in the 
range of W for every ordering of the nodes, and so for every (n— d)-connected graph 
G, 


(10.32) GOR(G) = LOR(G) = Rng(¥). 


10.6.2. Irreducibility. One may wonder whether various algebraic proper- 
ties of the varieties ORa(G), GORa(G) etc. have any combinatorial significance. 
The dimension of this variety may be interesting, but the fact that this is an NP- 
hard quantity even in the simplest case when d = 1 (Example should warn 
as that this dimension my be intractable. In this section we address only one 
question, namely the irreducibility of ORa(G), which does have some interesting 
combinatorial aspects. 

A set A C RN is reducible if there exist two polynomials p,q € R[x1,..., xy] 
such that their product pq vanishes on A, but neither p nor q vanishes on A; 
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equivalently, the polynomial ideal {p: p vanishes on A} is not a prime ideal. If an 
algebraic variety A is reducible, then it is the union of two algebraic varieties that 
are both proper subvarieties, and therefore they can be considered “simpler” . 


Proposition 10.41. If G is not (n—d)-connected, then the algebraic variety 
ORa(G) of its orthogonal representations is reducible. 


Proof. In this case G has no general position orthogonal representation in R@ 
by Theorem Hence equation (10.27) holds on ORg(G). On the other hand, 
none of the equations 


det (vi,,-.., Vig) = 0 (Lin <o*<%g 57) 
holds everywhere on ORg(G), because 


ex, iff =4%, k= 1,25, 
Vj = . 
0, otherwise, 


is an orthogonal representation with det(v;,,...,Vvi,) 4 0. 


In the opposite direction, we show the following. 


Proposition 10.42. For every graph G, the semialgebraic variety GORa(G) of its 
orthogonal representations in general position is irreducible. 


Proof. Recall that GOR4«(G) is nonempty if and only if G is (n—d)-connected, 
so the assertion is trivial unless G is (n—d)-connected. Let f and g be two poly- 
nomials such that fg vanishes on GOR«g(G). Then fg vanishes on the closure of 
GORa(G); by this means that fg vanishes on the range of UW. In other 
words, f(Wx)g(Wx) vanishes for all x € R¢™”. Since f(Wx) and g(Wx) are polyno- 
mials in x, it follows that either f(Wx) or g(Wx) vanishes identically; say the first 
occurs, then f vanishes on Rng(W), and hence also on GOR,g(G) by (10.32). 


From now on, we assume that G is (n—d)-connected. To exclude some trivial 
complications, we assume that d < n, so that G is connected. Since this connectivity 
condition does imply a lot about orthogonal representations of G in R¢ by Theorem 
we might expect that ORa(G) will be irreducible in this case. Example 
shows that this is false in general, so Proposition [10.41] cannot be reversed. Our 
goal is to prove that, at least for d < 4, this is essentially the only counterexample. 
The case of higher dimensions remains open. 


Theorem 10.43. Let G be an (n—d)-connected graph, where d < 4. Then 
GORa(G) is dense in ORa(G), and hence ORa(G) is irreducible, unless d = 4 
and G has a connected component that is the skeleton of the 3-cube. 


To give the proof, we need some definitions (just for this section). We say 
that an orthogonal representation of a graph G in R¢ is exceptional if it does not 
belong to the topological closure of GOR«g(G). A graph G is exceptional, if it 
is has an exceptional orthogonal representation. (If G is not (n—d)-connected, 
then every representation in ORa(G) is exceptional, but we are interested in the 
(n—d)-connected case.) By this definition, the set of exceptional orthogonal repre- 
sentations of G in R@ is relatively open in OR4(G). 

If an induced subgraph of a graph G is exceptional, then so is G itself: we can 
extend the exceptional representation of the induced subgraph to an orthogonal 
representation of the whole graph, and clearly this remains exceptional. (Note that 
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every induced subgraph G" is (|V(G’)|—d)-connected.) A graph G is minimally 
exceptional, if it is exceptional, but no proper induced subgraph of it is exceptional. 

Let x and y be orthogonal representations of G in R¢. If we keep x fixed and 
make 5°; |y;—x,| small enough, then for every S C V for which x(S) is linearly 
independent, so is y(S), while some of the linear dependencies among the x; may 
be broken. We say that the orthogonal representation x is locally freest, if for every 
orthogonal representation y for which )°, |y;—x,| is small enough, and every subset 
S CV, x(S) is linearly dependent if and only if y(S) is. Clearly every graph having 
an exceptional representation in R®@ has a locally freest exceptional representation 
arbitrarily close to it. Equation implies that if a locally freest orthogonal 
representation is in locally general position, then it is in general position. 

The following lemma summarizes some useful properties of minimally excep- 
tional graphs and their locally freest exceptional representations. When a graph G 
and an orthogonal representation x are understood, we define A, = lin(x(N(u))) 
foruée V. 


Lemma 10.44. Let G be a minimally exceptional graph, and let x be a locally 
freest exceptional orthogonal representation of it. 


(a) The complementary graph G is connected. 


(b) For everyuéV, the set x(N(u)) is linearly dependent. 

(c) Let SCV andue€V\S. If x, € lin(x(S)), then At C lin(x(S)). 

(d) Any two vectors xy, are linearly independent, and 2 < dim(A,,) < d—2 for 
every node u. In particular, no node is represented by the zero vector. 


Proof. (a) Suppose that V(G) = Vi UV, is a partition into nonempty sets so 
that every node in Vj is adjacent to every node in V2 in G. Consider any orthogonal 
representation x of G. Then the orthogonal representations of x|y, of G[V;] are 
not exceptional, and so they can be approximated by general position orthogonal 
representations y;. Then y;Uy2 is an orthogonal representation of G which is in 
locally general position. By (10.32), yiUy2 can be approximated by orthogonal 
representations in general position, so x is not exceptional. 

It follows that no node of G is isolated, and since every node of G has degree 
at least n—d, we get that n > d+2. 


(b) Call a node v € V fat if x(NV(v)) is linearly independent and slim otherwise. 
We want to prove that all nodes are slim. There must be at least one slim node 
(else, the representation is in locally general position and hence in general position, 
and not exceptional). 

Suppose that G has a slim node i and a fat node 7. Since G is connected, we 
may choose i and j so that they are adjacent. 

The representation x’ obtained by restricting x to V(G\jJ4), is an orthogonal 
representation of G’ = G\Jj, and hence there exists a y’ € GOR4«(G’) arbitrarily 
close to x’. We can extend y’ to an orthogonal representation y of G; since x’ (N(j)) 
and y’(N(j)) are linearly independent, we can choose y; so that it is arbitrarily 
close to x; if y’ is sufficiently close to x’. 

This extended y is strictly locally freer than x: indeed, a fat node remains 
fat (if x’ and y’ are close enough), and (since any d nodes different from j are 
represented by linearly independent vectors) j is fat in y. This contradicts the 
assumption that x is locally freest. 


178 10. ORTHOGONAL REPRESENTATIONS: DIMENSION 


(c) Clearly x, € Ay. If Ay Z lin(x(S)), then Ay contains a vector y arbitrarily 
close to X,, but not in lin(x(S)). Replacing x, by y we obtain another orthogonal 
representation of G which is strictly freer than x, contradicting the definition of 
locally freest representations. 


(d) For every node wu, |N(u)| < d—1 and the vectors in x(N(w)) are linearly 
dependent by (a), hence dim(A,,) < d—2. Any two representing vectors must be 
linearly independent: if x, = AX,, then by (b), A+ is contained in the linear span of 
X,, which is impossible, since dim(A+) = d—dim(A,,) > 2. Since |N(u)| > n—d> 
2, this shows that dim(A,,) > 2. 


Using these facts about minimally exceptional graphs, we turn to the proof of 
the main theorem. 


Proof of Theorem [10.43} We may assume that G is a minimally exceptional 
graph with an exceptional orthogonal representation x. We may assume that x 
is locally freest. Lemma d) implies that 2 < d—2, which implies that in 
the case d < 3, no exceptional representations exist. Suppose that d = 4, then 
Lemma [10.44{d) implies that G is 3-regular, and the subspaces A, defined above 
have dim(A,,) = 2. Any two vectors x, (u € N(v)) generate A,, and thus the third 
element of x(N(v)) is linearly dependent on them. 

Let uw and v be nonadjacent nodes. Then x,, € lin(x(N(v)\u)) = Av, and so 
Lemma [10.44{c) implies that AL C A,. Since these are 2-dimensional subspaces, 
it follows that A, = A}. Since G is connected, fixing any node v, the rest of the 
nodes fall into two classes: those with A, = A, and those with A, = A}. This 
defines a bipartition V = UUW of G. By this argument, x(U) and x(W) span 
orthogonal 2-dimensional subspaces L and Lt. 

Let us try to replace x; by x;+¢y;, where 

E+, ifieu, 
ans) ee ty ifiewW 
(so the modifying vector y; is orthogonal to x;), and ¢ is sufficiently small. Let us 
assume that the vectors y; satisfy the equations 


(10.34) x}yj;t+xjyi=0 forallije FB. 
Then for every ij € E and real e, we have 
(xi +eyi) "(xy beys) = x1xj Fey; +xjy¥i) te*y Ly; = 0, 

so the modified vectors form an orthogonal representation of G. Choosing ¢ small 
enough, the orthogonal representation x+ey will be arbitrarily close to x, and 
hence it is a locally freest exceptional representation. This implies that sets {x;+ 
ey; : « © U} and {x;+ey; : i © W} span orthogonal 2-dimensional subspaces. 
Thus (x;+ey;)'(x;+ey,;) = 0 for all i € U and j € W (not just for nonadjacent 


pairs). Expanding this equation, we get that (10.34) holds for alli € U and j € W. 
One way to construct vectors y; satisfying (10.33) and (10.34) is the following: 


we take any linear map B: L —-> L+, and define 


Bx; ifieU, 
yi = 


10.35 
( ) —B'x, ifiew. 
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It is easy to check that these vectors satisfy (10.33) and (10.34). We claim that 
this is all that can happen. 


Claim. If the vectors y; € R* satisfy (10.33) and (10.34), then they can be obtained 


as in (10.35) : 


To prove this, consider any linear relation Se y ix; = 0 between the vectors 
in x(U). Since (10.34) holds for all i € U and j € W, it follows that 


S- XY: = > OXY; =0 
icU icU 
for every j € W. Since the vectors in x(W) span L+, this implies that yiey Yi = 
0. So every linear equation satisfied by the vectors x; (7 € U) is also satisfied by the 
vectors y;. This implies by elementary linear algebra that there is a 2 2 matrix 
B for which y; = Bx;. Substituting in (10.34), we get that x/(y;+B'x;) =0. So 
the vector y; +B"x, € L is orthogonal to x; for every i € N(j). This implies that 
Yj + B'x; = 0, proving the Claim. 
Now it is easy to complete the proof. Equations and (10.34), viewed as 
a system of linear equations for the y;, have 4n unknowns, and with 2n+m = 7n/2 
equations, so it has a solution set that is at least n/2-dimensional. By the Claim, 
all these solutions can be represented by (10.35). This gives a 4-dimensional space 
of solutions, son < 8. Thus G is a 3-regular bipartite graph on at most 8 nodes; it 
is easy to see that this means that G = K33 or G = Q3. The former is ruled out 
since G is connected. Thus the only minimally exceptional graph for d = 4 is Q3. 
It follows that every exceptional graph G contains Q? as an induced subgraph. 
Since all degrees of G are at least n—4, it follows that Q? is a component of G. 


10.6.3. More on algebraic properties. The algebraic structure of orthog- 
onal representations has, quite probably, further interesting connections with the 
combinatorial structure of the underlying graph. 

For a subset S C R”, we denote by Z(S) the set of polynomials in n vari- 
ables that vanish on all points of S. This is an ideal, whose algebraic properties 
are closely related to geometric properties of S. We are interested in the ideals 
Ta = T(ORa(G)) and Z', = T(GOR«(G)), both of which consist of polynomials in 
variables «;,, where i € V and r € [d]. 

Theorem Proposition and Theorem can be recast in this 
algebraic language. It follows that the ideal Z/, is a prime ideal if and only if G 
is (n—d)-connected. If G is not (n—d)-connected, then Zz is not prime. If G is 
(n —d)-connected, then for d < 3 the ideal Tq is prime, and this also holds for d = 4 
unless G has a connected component isomorphic to the 3-cube. 

Another interesting ideal is La, generated by the polynomials x}x; = 
a LipLjr, where ij € EF. It is clear that £ C Zq C Ti, for every dimension 
d>1. 

The ideal £; is called an edge ideal, and many properties of it have been studied; 
see e.g. (Morey—Villarreal 2012]. Their results easily imply that £; = Z{, = Z; for 
any graph. The structure of the ideal £2 was studied in [Herzog et al. 2015). 


Among others, they prove that this ideal is radical. 


Exercise 10.1. Which graphs have an orthonormal representation in the plane? 
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Exercise 10.2. Prove the edges of every graph on n nodes can be covered by 
[n?/4| cliques, and show that this is best possible. 


Exercise 10.3. Let G be a triangle-free graph and let u be an orthonormal 
representation of G in R“. Consider the matrix B €¢ RY*" defined by 


Bi; = ujuy, = : # i 
0, ifi=j7. 
Let s, be the sum of k-th powers of the eigenvalues of B, and let a, be the k-th 
elementary symmetric polynomial of these eigenvalues. Prove that (a) s1 = a1 = 
0, (b) a3 = 0, (c) s3 = 0, (d) n < 2d. 
Exercise 10.4. Prove the inequalities (10.22) and (10.21). 


Exercise 10.5. Let © and A be two planes in R? that are neither parallel nor 
weakly orthogonal. Select a unit vector a; uniformly from %, and a unit vector 
bi € Anat. Let the unit vectors az and be be defined similarly, but selecting 
be € A first (uniformly over all unit vectors), and then az from SNbs+. Prove 
that the distributions of (ai, bi) and (ae, b2) are different, but mutually absolutely 
continuous. 


Exercise 10.6. Let u: V > R® be an orthogonal representation of the graph 
G, and let S be a minimal set such that u(S) is linearly dependent. Prove that 
G[S] is connected. 

Exercise 10.7. Prove that for a graph G, the following properties are equivalent: 
(i) G is (n—d)-connected; (ii) there is a G-matrix such that all dx d submatrices 
are nonsingular; (iii) there is a positive semidefinite G-matrix of rank d such that 
all dx d submatrices are nonsingular. 

Exercise 10.8. Let G be a graph and let G’ be obtained from G by creating a 
new node and connecting it to all nodes of G. Prove that wir(G’) = wir(G) +1, 
Wpr(G’) = Wpr(G) +1 and Waig(G’) = Waig(G) +1. 

Exercise 10.9. If x is transversal as a solution of a system of algebraic equations, 
then it also has this property as a solution of any system obtained by dropping 
some of the equations. 

Exercise 10.10. A G-matrix A has the Strong Arnold Property if and only if for 
every symmetric matrix B there exists a G-matrix B’ such that 2'(B’—B)x =0 
for all vectors x in the nullspace of A. 


Exercise 10.11. Let Rx be the manifold of all (not necessarily symmetric) V x V 
matrices with rank k. Determine (a) dimension of Rx, (b) the tangent space of 
Rx, and (c) the normal space Ry at a matrix Ac R*. 

Exercise 10.12. Let G be a graph, and let G = Gi UG2 where |V(G1)N(G2)| = 
1. Prove that Waig(G) = max{Waig(G1), Waig(G2)}. 

Exercise 10.13. Suppose that G has no K, minor. Prove that wp:(G) < 3, and 
if Wpr(G) = 3, then G contains A3 as a minor. 

Exercise 10.14. (a) For every bipartite graph G, the graph GX K2 is a minor 
of GOIC4. (b) Kom41 is a minor of Q?"*1. 

Exercise 10.15. (a) Prove that if a graph G has node-expansion c, then its 


treewidth is at least en/(c+3)—1. (b) Prove that the treewidth of an nxn grid 
graph is n. (c) Prove that the product-width of the n-cube is at least 2”~+. 


Exercise 10.16. Determine all convex polytopes with an edge-transitive isometry 
group for which u'v < 0 for every edge uv. 


Exercise 10.17. Prove that the variety {(u1,u2) : uz € R*, uju2 = 0} is 
indecomposable. 
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Exercise 10.18. Given a graph G, construct a real algebraic variety V that 
consists of general position orthogonal representations, and every general position 
orthogonal representation, after being scaled by a positive scalar, belongs to V. 


CHAPTER 11 


Orthogonal Representations: the Smallest Cone 


In this chapter we generalize the discussion in the introductory Section 
and study orthogonal representations that are “economical” in a sense different 
from dimension. It turns out that the smallest half-angle of a rotational cone 
(in arbitrary dimension) that contains all vectors in an orthogonal representation 
contains interesting information about the graph [Lovasz 1979b]. To be precise, 
we will work with a transformed version of this quantity, the “theta-function”. 
This quantity will be related to the stability number, the clique number and the 
chromatic number of graphs. 

We describe several applications of this graph parameter, of which we mention 
two problems in which this parameter yields the only known solution. One of these 
is the problem of the Shannon capacity (or zero-error capacity) of a graph from 
Section[1.4] which is a tough parameter to compute, and whose evaluation even for 
a small graph like the pentagon needs the theta-function. The other application is 
the computation of the clique number and chromatic number for perfect graphs in 
polynomial time. 


11.1. The theta-function 
Let us start with the definition: 


: 1 
(14) o(G) = minmex ora 


where the minimum is taken over all orthonormal representations (u; : i € V) 
of G and all unit vectors c. We call c the “handle” of the representation (for the 
origin of the name, see Example (1.2). If ¢ is the half-angle of the smallest rota- 
tional cone with axis in the direction of c, containing all vectors in an orthonormal 
representation, then clearly 


1 
(cos 6)? 


Of course, we could fix ¢ to be (say) the standard basis vector e;, but this is 
not always convenient. We may always assume that the dimension d of the ambient 
space is at most n. We could require that cu; > 0 for all nodes, since we can 
replace any u; by its negative. With just a little more complicated argument, we 
may even require that 


(11.2) 0(G) = 


(11.3) clu; = 
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for every node i without changing the optimum value in (11.1). Indeed, let 
aj = 1/(\/0(G)c"u;), then 0 < a; < 1, and so we can replace u; by the (d+n)- 
dimensional vector 


“i= aj Uy 
=| Anae; 
Extending c with n zeroes to get a unit vector c’, we get an orthonormal represen- 
tation in which (c’)'u, = 1/,/0(G) for all i. 
The following rather easy bounds will be very important. 


Theorem 11.1. For every graph G, 


a(G) < ¥(G) < x(@). 


Proof. First, let S C V be a maximum stable set of nodes in G. Then in every 
orthonormal representation (u;), the vectors {u;: i € S} are mutually orthogonal 
unit vectors. Hence 
(11.4) 1 =|c|? > S°(c'u;)? > |S] min(c"u,)”, 

ies / 
and so 


a Ss = ; 
ev. (cTu;)? — Pe CAG) 


This implies the first inequality. The second follows from the orthogonal represen- 
tation constructed in Example using c = (1/,/m)1 as the handle. 


Our examples in Section give further upper bounds on 3. Example 
leads to the following construction: Assuming that there are no isolated nodes, we 
assign to each node i € V the vector u; € R” defined by 


: if e is incident with i 
—_——__ if e is incident with i, 
(ui)e = 4 Vdeg(?) 
0, otherwise, 


and define the handle ¢ = (1/./m)1. Then c'u; = \/deg(i)/m, and so we get the 


bound 
m m 


deg(i) — dmin- 


(The upper bound m/dmin for the stability number a(G) is easy to prove by count- 
ing edges.) From Example we get by elementary trigonometry that 


o(G) < max 


(11.5) O(Cs) < V5. 


Soon we will see that equality holds here. 

It is clear that if G’ is an induced subgraph of G, then 0(G’) < 0(G) (an 
optimal orthonormal representation of G, restricted to V(G’), is an orthonormal 
representation of G’). It is also clear that if G’ is a spanning subgraph of G 
(i.e, V(G’) = V(G) and E(G’) C E), then 0(G’) > 0(G) (every orthonormal 
representation of G’ is an orthonormal representation of G). 
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11.2. Duality for theta 


11.2.1. Alternative definitions. The graph parameter J has many equiv- 
alent definitions. We are going to state some, which lead to an important dual 
formulation of this quantity. 


Vector chromatic number. The following geometric definition was proposed by 
Karger, Motwani and Sudan. In terms of the complementary graph, this value is 
called the “vector chromatic number”. As a motivation for this name, consider a t- 
colorable graph (t > 2), and let us color its nodes by t unit vectors f,,...,f, € R~?, 
pointing to the vertices of a regular simplex centered at the origin. We get a vector 
labeling w (not an orthogonal representation of G, but closely related). It is not 
hard to compute that f/f; = —1/(t—1) for k #1, and so w/w; = —1/(t—1) for 
ij € E. 

Now let us forget about the condition that w; must be one of the vectors 
f,: define a (strict) vector t-coloring (t > 1) of the graph G as a vector labeling 
i++ w; € R” such that |w;| = 1 for all i € V, and w/w; = —1/(t—1) for all 
ij € E. (The dimension n in the definition above is just chosen to be large enough; 
allowing a higher dimension would not make any difference.) 

If G has at least one edge, then the last condition implies that t > 2. The 
smallest ¢ for which the graph G has a vector t-coloring is called its (strict) vector 
chromatic number, and is denoted by Yvec. If E = 0, so G is edgeless and y(G) = 1, 
then we define yyec = 1. For t = 2, the endpoints of any edge must be labeled by 
antipodal unit vectors. It follows that every connected component of G is labeled 
by two antipodal vectors only, and so G is bipartite. Conversely, every bipartite 
graph has a strict vector 2-coloring. 

It is clear from the construction above that Yvec(G) < x(G) for every graph 
G. Equality does not hold in general. Vector-labeling the nodes of a pentagon by 
the vertices of a regular pentagon inscribed in the unit circle, so that the edges are 
mapped onto the diagonals, we see that Yvec(Cs) < V5 < x(Cs) = 3. It is also easy 
to see that Vvec(G) > w(G). 


Semidefinite matrices. Next, we give a couple of formulas for J in terms of 
semidefinite matrices. Let 


11.6 Vdiag = minfl+maxY,;: YeR”’*”, Y>0, Yj; =—-1 (ij EE 
Ss a Vv a 
ve 


and 


(11.7) 


Boum =max{ D> Zi: ZERY*Y, Z~0, Zyj =0 (ij € B), (Z) = 1}. 
ujEV 


It will turn out that these two values are equal. This equality is in fact a special case 
of the Duality Theorem of semidefinite optimization. It is not hard to check that 
(11.6) and are dual semidefinite programs, and the first one has a strictly 
feasible solution. So the Duality Theorem|13.5]of semidefinite programming applies, 
and asserts that the two programs have the same objective value. However, we are 
going to include a proof, to make our treatment self-contained. 
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Dual orthogonal representation. We use orthonormal representations of the 
complementary graph to define 


(11.8) anal = max 5 >(d'v;)?, 

icV 
where the maximum extends over all orthonormal representations (v;: i € V) of 
the complementary graph G and all unit vectors (handles) d. 


11.2.2. The main duality result. The main theorem of this section asserts 
that all these definitions lead to the same value. 


Theorem 11.2. For every graph G, 


v(G) = Xvec(G) = Vaiag (G) = Ysum(G) = Vauai(G). 


Proof. We prove the circle of inequalities 


(11.9) 0(G) < Xvee(G) < Vaiag(G) < Vsum(G) < Vauai(G) < v(G). 


To prove the first inequality, let t = Yvec(G), and let (w; : i € V) be an 
optimal vector t-coloring. Let c be a vector orthogonal to all the w; (we increase 
the dimension of the space if necessary). Let 


Vt t 
Then |u;| = 1 and u/u; = 0 for ij € E, so (u;) is an orthonormal representation of 
G. Furthermore, with handle ¢ we have c'u; = 1/V#, which implies that 0(G) < 
Xvec(G). 

Second, let Y be an optimal solution of (11.6). We may assume that all di- 
agonal entries Y;; are the same number t, since we can replace all of them by the 
largest without violating the other constraints. The matrix 1/(t—1)Y is positive 
semidefinite, and so it can be written as Gram(w; : i € V) with appropriate vec- 
tors (w; € R”). These vectors form a strict vector t-coloring. Since Yyec(G) is the 
smallest t for which this exists, this proves that Yvec(G) < t = Vaiag- 

The main step in the proof is to show that Vajiag < Vsum- Fix any t > Vsum; 
it is easy to see that Vsum > 1 and hence t > 1. Let £ denote the linear space of 
symmetric V x V matrices satisfying Z,; = 0 (ij € EF) and (tl—J)-Z = 0, and let 
P denote the cone of positive semidefinite V x V matrices. 

We claim that PML = {0}. Suppose, to the contrary, that there is a symmetric 
matrix Z #0 such that Z € PNL. Clearly tr(Z) > 0, and so, by scaling, we may 
assume that tr(Z) = 1. Then Z satisfies the conditions in the definition of Usum, 
and so Vsum > J: Z =tl-Z =t, contradicting the choice of t. 

So the linear space £ touches the convex cone P at its apex only, and hence 
there is a hyperplane H such that £ C H and HNP = {0}. Let Y-X = 0 be the 
equation of H (where Y 4 0 is a symmetric V x V matrix); we may assume that 
Y-X > 0 for all X € P. This means that Y is in the dual cone of P. This dual 
cone is P itself, which means that Y > 0. Furthermore, £ C H means that the 
equation of H is a linear combination of the equations defining CL, i.e., there are 
real numbers a,;; (ij € E) and b such that 


ijEk 
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Considering a positive diagonal entry of Y, we see that b > 0, and since we are free 
to scale Y by positive scalars, we may assume that b = 1. But this means that Y 
satisfies the conditions in the definition of Vaiag, and so Vadiag < 1+max; Yi = t. 
Since this holds for every t > Usum, this implies that Vaiag < Usum- 

To prove the fourth inequality in (11.9), let Z be an optimum solution of the 
program with objective function value Jsum. We can write Z as Gram(z;: i € 
V) where z; € R”. Let us rescale the vectors z; to get the unit vectors v; = z? 
(if z; = 0, then we take a unit vector orthogonal to everything else as v;). Define 
=e a. 

By the properties of Z, the vectors v; form an orthonormal representation of 
G, and hence 


Vaal = So(d'v:)?. 
To estimate the right side, we use the equations 
ia? => z= (2) = 1, pee 
and the Cauchy—Schwarz Inequality: 
2 
Sarvs)? = (Sel?) (aT?) = (NH esla"ws) 
272 o 2 
- (Soa'a)* = (aa) = [Eo 
This proves that Vgual > Vsum- 
Finally, to prove the last inequality in (11.9), it suffices to prove that if (u;: i € 


V) is an orthonormal representation of G in R” with handle c, and (v;: i € V) is 
an orthonormal representation of G in R™ with handle d, then 


2 
7 
= s ZL, Lj = s Zig = Ysum, 
ij ij 


2 
= sum: 


(11.10) S(d'vi)? < max = 


The tensor product vectors u;ov; (¢ € V) are mutually orthogonal unit vectors. 
Indeed, (u;ov;)'(ujov,;) = (uju;)(v/v;) = 0, since either u; is orthogonal to u,; 
or v; is orthogonal to v;. Hence 


(11.11) S-(cTuj)?(d"vi)? = S>((cod) (ujov,))” <1. 
On the other hand, 

S-(clui)?(d'v;)? > min(cTu;)? 50 (d'vi)’, 
which implies that 
1 1 


—_——,, = max 
min(c'u,;)? i (c™u,)? 
u 


So(d'vi)? < 


a 


This proves (11.10) and completes the proof of Theorem 
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We can state the theorem more explicitly as the following sequence of formulas. 


e 1 . _ 
(11.12) 3(G) = min {max Cora ye : u ONR of G, |e] = i} 
(11.13) = min{t viii, wie = ee B)} 
’ a"? t—1 

(11.14) = min{1+max Yu : Y +0, Yj =-1 (ij € E)} 
(11.15) = max{ \~ Zig: ZH 0, Ziyj =0 (ij € B), tr(Z) = i} 

i,jEV 
(11.16) = max{ (d's)? : v ONR of G, |d| = i}, 

tEV 


From this form it is clear (and we have seen this in the proof as well) that the 
powerful step in this sequence of formulas is the equality (11.14)=(11.15), where 
an expression as a minimum switches to an expression as a maximum. Note that 
before this equality we have conditions on the edges of G, which then get replaced 
by conditions on the edges of G for the last two rows. 


Remark 11.3. If we 7 To Wane a es upper oe on a(G) than 0(G), we can 
relax the conditions in or (11.14) (11.14) 14). This is equivalent to tightening 
the conditions in the aun 9) problems (11.15) or (11.16) 

McEliece-Rodemich—Rumsey 1978] and 9 to tighten 
the conditions in ae by requiring that Z;; > 0 for all i and : This is equiv- 
alent to relaxing the conditions in (1 (11.14) 14 (11.14) by requiring only Y;; > —1 for ij € E. 
The resulting quantity J~(G) is re a well-characterized (ee polynomial time 
computable) upper bound on a(G), tighter than 0(G). The other characterizations 
of )(G) have natural modifications to yield ¥~(G). 

If we want to get a better bound on x(G), we need to relax the conditions in 
(11.14), or tighten the conditions in (11.15). Se Tt The the quantity 
obtained by adding the conditions A;; > —1 (ij € £) in (11.14). This is equivalent 
to relaxing the conditions in by requiring only 2;; < 0 for ij ¢ E. This 
results in a quantity 0*(G), which is again well-characterized and polynomial time 
computable. 

These quantities satisfy the inequalities 


a(G) <0 (G) < WG) < 0G) < x@). 


The two modified graph parameters lack some of the more algebraic properties of 


0 (for example, Theorem [11.8] 11.8} below), but they give better bounds on a(G) and 
fain 


x(G), respectively. See Exercises 5111.8 for more on these parameters. 


11.2.3. Consequences of duality. From the fact that equality holds in 
(11.9), it follows that equality holds in all of the arguments above. Let us for- 
mulate some consequences. From the fact that equality must hold in (11.11), and 
from the derivation of this inequality, we see that for an optimal orthonormal rep- 
resentation (u,c) and an optimal dual orthonormal representation (v,d), the unit 
vector cod must be a linear combination of the mutually orthogonal unit vectors 
u,ov,;. The coefficients are easy to figure out, and we get 


(11.17) cod= d_(eTus)(d"vi)(uievi), 
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or in a matrix form: 
(11.18) ed’ = 5 °(c'u,)(d'v,)uivy. 

i 
This equation gives (upon multiplying by d from the right and by c!' from left, 
respectively) two equations expressing the handles as linear combinations of the 
(primal and dual) orthonormal representations: 
(11.19) c=) (clu)(d'v;)’u;, d= S$ o(clu;)?(d'v,)vi. 

i 


a 


Using an optimal orthonormal representation with cu; = 1/V0, we get the simpler 
formulas 


Sos yea, Gee ae 
(11.20) = TP se iui, d ne 24 Vi. 


There is a simple relationship between the theta value of a graph and of its 
complement. 


Proposition 11.4. For every graph G, we have 3(G)0(G) > n. 


Proof. Let (u,c) be an optimal orthogonal representation of G. Then applying 
(11.16) to the complementary graph, we get 


v(G) > Dd i(cTu,)? > nmin(cTu;)? = 


n 


a) 


Equality does not hold in in general, but it does when G has a node- 
transitive automorphism group. To prove this, we need an important fact about 
the symmetries of orthogonal representations. We say that an orthonormal repre- 
sentation (u;,c) in R@ of a graph G is automorphism invariant, if every automor- 
phism y € Aut(G) can be lifted to an orthogonal transformation O, of R? such that 
O,c = cand u,(;) = O,u; for every node 7. An optimal orthonormal representation 
(say, in the sense of (11.12)) is not necessarily invariant under automorphisms, but 
there is always one that is (see Figure[11-1p. The representation on the left is also 
optimal with respect to minimizing the dimension, and it is not hard to see that C4 
has no automorphism invariant orthonormal representation in R?. So minimizing 
the dimension and minimizing the cone behave differently from this point of view. 


Ug=U4 


FiIGuRE 11.1. An optimal orthonormal representation of C4 that 
is not invariant under its automorphisms, and one that is. 
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Theorem 11.5. Every graph G has an optimal orthonormal representation and an 
optimal dual orthonormal representation that are both automorphism invariant. 


Proof. We give the proof for the dual orthonormal representation. The opti- 
mum solutions of the semidefinite program in form a bounded convex set, 
which is invariant under the transformations Z ++ P,ZP., where Py, is the permu- 
tation matrix defined by an automorphism a of G. If Z is an optimizer in (11-15), 
then so is P'ZP, for every automorphism a € Aut(G), and hence also 

a 1 T 
Z= Tmat(Gy] Y Fzee 
a€Aut(G) 


This matrix satisfies P'ZP, = Z for all automorphisms a. 

The construction of an orthonormal representation of G in the proof of Vaiag < 
Yeum in Theorem|11.2|can be done in a canonical way: we choose the columns of Z 172 
as the vectors z;, and use them to construct the dual orthonormal representation 
with v; = 2? and d = (5¢,z;)°. The optimal dual orthonormal representation 
constructed this way will be invariant under the automorphism group of G. 


Corollary 11.6. If G has a node-transitive automorphism group, then 
0(G)0(G) =n. 
Proof. It follows from Theorem that G has an orthonormal representation 


(v;,d) in R” such that 5>,(d'v,)? = 0(G), and d'v; is the same for every i. So 
(d'v;)? = 0(G)/n for all nodes i, and hence 


— 1 n 
UG) < ———— = ——. 
ane (d'v,)2 #(G) 
Since we already know the reverse inequality (Proposition (11.4), this proves the 
Corollary. 


Corollary 11.7. If G is a self-complementary graph with a node-transitive auto- 
morphism group, then 0(G) = ./n. In particular, (Cs) = V5. 


A further important feature of the theta-function is its nice behavior with 
respect to graph product; we will see that this is what underlies its applications in 
information theory. 

There are many different ways of multiplying two simple graphs G and H, of 
which we need one in this chapter. The strong product GXH is defined on the 
underlying set V(G) x V(H). Two nodes (ui,v1) and (uz, v2) are adjacent if and 
only if either 17 € E and uv € E(A), or ij € FE andu =v, ori = 7 and w € E(#). 
It is easy to see that this multiplication is associative and commutative (up to 
isomorphism). The product of two complete graphs is a complete graph. 


Theorem 11.8. For any two graphs G and H, we have 0(GRA) = v(G)0(A). 


Proof. Let (u; : i € V) be an optimal orthogonal representation of G 
with handle c (u;,c € R”), and let (vj : j € V(H)) be an optimal orthog- 
onal representation of H with handle d (v;,d € R™). Recalling the identity 


(uov)'(xoy) = (u'x)(v'y) (used in the Introduction), we see that the vectors 
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ujov,; ((i,7) € V(G) x V(#)) form an orthogonal representation of GXH. Fur- 
thermore, taking cod as its handle, we have 


2 2 2 1 1 
((cod) "(u;ov;)) = (c'u,) (dov;)" > HG vay’ 
and hence 
0(GXMA) < max : < 0(G)9(H). 


J ((cod)"(ujov;))” 


To prove that equality holds, we use the duality established in Section 
Let (v;,d) be an orthonormal representation of G which is optimal in the sense 
that 5>,(d'v;)? = 9(G), and let (w;,e) be an orthonormal representation of H 
such that )>,(e'w;)? = 0(H). It is easy to check that the vectors v;ow, form an 
orthonormal representation of GX H, and so using handle doe we get 


O(GBH) > Y*((doe)"(viow;))” = )\(d'v;)?(e™w;)? = 0(G)0(H). 


i,j a,j 


We already know the reverse inequality, which completes the proof. 


11.2.4. Eigenvalues and theta. To motivate the identities and inequalities 
to be proved in this section, let us survey some of the classical results which use 
spectral properties of graphs, or more generally linear algebra techniques, to bound 
quantities like the stability number a = a(G), the clique number w = w(G), or the 
chromatic number x = x(G) in terms of the eigenvalues of the adjacency matrix 
A= Ag. It turns out that several of these results could be used to define 0(G), if 
generalized appropriately. 

Let us start with an almost trivial inequality: 


(11.21) w <1+Amax(A). 
In terms of the complementary graph, 
(11.22) a < 14+ Amax(Aq)- 


Indeed, the matrix A+J contains an w x w submatrix J, of 1’s, so 1+Amax(A) = 
Amax(A+I) > Amax(Jw) = w. Note that in this argument, only the matrix entries in 
adjacent and diagonal positions are used. We could consider any symmetric matrix 
A’ obtained from A by substituting arbitrary real numbers for the remaining entries 
(which are originally zeroes), to minimize the bound Amax(A’)+1. (We keep A’ 
symmetric, to have real eigenvalues.) What is the best bound on w we can obtain 
this way? 

The following lower bound on the chromatic number of a graph 
is more difficult to prove: 


Amax (A) 


11.2 Sf 
(11.23) eae Yonin(A) 


(note that Amin(A) < 0 if G has at least one edge, which we may assume). We will 
not go through the proof; but if you do, you realize that it uses only the 0’s in the 
adjacency matrix, so we can play with the 1’s to get the sharpest possible lower 
bound. What is the best bound on y we can obtain this way? 
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Hoffman (unpublished) proved the following upper bound on a, somewhat anal- 

ogous to the bound (11.23): If G is a d-regular graph, then 
—NXmin(A) —NXmin(A) 

hey ae 7 d— Avia A) 7 Amax(A) =Amia(A) ; 
Looking at the proof, one realizes that we use here where the 0’s of A are, and also 
the fact that all row sums are the same; but not the actual values of the entries 
corresponding to edges. What is the best bound we can obtain by playing with the 
entries in adjacent positions? 

Perhaps it is not surprising that the answer to the first two questions posed 
above is 0(G), and the answer to the third is closely related. This will fol- 
low from the next identities. The first and third formulas below were proved in 


Lovasz 1979b), the second, in [Knuth 1994). 
Proposition 11.9. For every graph G, 
(11.25) 0G) = min Amax(U), 


where U ranges over all V x V-matrices with Uj; =1 for ij € E and also for i = j. 
Furthermore, 


(11.26) 0G) = max Amax(W), 


where W ranges over all positive semidefinite V x V-matrices with Wi; = 0 for 
ij € FE andW;; =1 fori eV. Furthermore, 


(11.27) OG) = Tmax Sanco 


where T ranges over all symmetric nonzero V x V-matrices with T;; = 0 for ij € E 
and also fori =. 


Note that (11.26) can be written as 
(11.28) 0(G) = max Amax(Gram(v)), 


where v ranges over dual orthonormal representations of G (see Exercise |11.12} for 
the connection of this formula with the definition |11.16] of ¥). 


Proof. Let Y be a minimizer matrix in (11.14), let D be the diagonal matrix 
obtained from Y by changing all off-diagonal entries to 0, and define U = I+D—-Y. 
Then U satisfies the conditions in the Proposition, and (using that Y > 0) 


Amax(U) < Amax (I+ D) = 1+max Yi = HG). 


The reverse inequality follows similarly, by starting with a minimizer in (11.25), 
and considering Y = Amax(U)I—U. 

To prove (11.26), let Z be a minimizer in (11.15), and define a V x V matrix 
B by 


Zij 
W,; = 


Then W satisfies the conditions in (11.26). Define a vector « € RY by 2; = VZi, 
then x is a unit vector and 
Amax(B) > «'Wa = S 4ij3 = v(G). 


tJ 
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The reverse inequality follows similarly, by starting with an optimizer W in (11.26), 
and scaling its rows and columns by the entries of an eigenvector belonging to 
Amax(W). 
. . . . . . _ 1 
Minally, to prove (11.27), consider an optimizer T in it, then W = I— Tort) 
is positive semidefinite, has 0’s in adjacent positions and 1’s on the diagonal. Hence 


by (11.26), we have 


Nmiax (2) 
Amin (T) ; 
The reverse inequality follows by a similar argument. 


O(G) > Amax(W) = 1- 


Proposition[I1.9]can be combined with different known estimates for the largest 
eigenvalue of a matrix. As an example, using that the largest eigenvalue of a matrix 
is bounded above by the largest @;-norm of its rows, we get that for every optimal 
dual orthogonal representation v, 


(11.29) 0(G) < max S° |v vj]. 
j 


Finally, we show how a strengthening of Hoffman’s bound (11.24) can be derived 
from Proposition 
Proposition 11.10. Let G be ad-regular graph. Then for every symmetric nonzero 


V x V-matrit M such that M,; = 0 for ij € E and also for i = j, and M has equal 
row-sums, we have 


(2 — a, 

Amax(M) = Amin(M) 
If the automorphism group of G is transitive on the nodes, then there is such a 
matrix M attaining equality. If the automorphism group is transitive on the edges, 
then equality holds for M = Ag. 


Proof. Let M have eigenvalues \; = d > Ag >-:: > Ay. The matrix J—tM 
satisfies the conditions in for every value of t. Using the condition that all 
row-sums of M are the same, we see that 1 is a common eigenvector of J and M, 
and it follows that all eigenvectors of M are eigenvectors of J as well. Hence the 
eigenvalues of J—tM are n—td, —trA2,...,—-tAn. From the fact that the trace of 
M is zero, and hence ,, < 0, it follows that the largest eigenvalue of J—tM is 
either n—td or —tA,,, and we get the best bound if we choose t so that these two 
are equal: t = n/(d—A,), giving the bound in the Proposition. 

We can see just as in the proof of Theorem that there is an optimizing 
matrix U in that is invariant under the automorphisms. So if G has a 
node-transitive automorphism group, then the row-sums of this matrix are equal, 
and the same holds for M = J—U. This matrix M satisfies the conditions in the 
Proposition, and attains equality. 

If G has an edge-transitive automorphism group, then all nonzero entries of MZ 
are the same, and hence M = tAg for some t 4 0. The value of t cancels from the 
formula, so M = Ag also provides equality. 


11.3. Computing the theta-function 


Perhaps the most important consequence of the formulas proved in 


Section is that the value of W(G) is polynomial time computable 
Grotschel—Lovasz—Schrijver 1981]. More precisely, 
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Theorem 11.11. There is an algorithm that computes, for every graph G and 
every € > 0, a real number t such that 


|\W(G)-t| <e. 
The running time of the algorithm is polynomial in n and log(1/e). 


Algorithms proving this theorem can be based on almost any of our formulas 
for ¥. The simplest is to refer to Theorem [11.2] giving a formulation of 0(G) as the 
optimum of a semidefinite program (11.6), and the polynomial time solvability of 
semidefinite programs (see Chapter |13). 

The significance of this fact is underlined if we combine it with Theorem [11.1] 
The two important graph parameters a(G) and y(G) are both NP-hard, but they 
have a polynomial time computable quantity sandwiched between them. 

Computability of the theta-function in an approximate sense, with an arbitrary 
small error, is an important fact, but not always satisfactory. Often we want explicit 
algebraic expressions, or at least explicit bounds. For example, it is important to 
know that 0(Cs) = V5, not just that it is between 2.23606 and 2.23607. In the 
rest of this section we compute the theta-function of several classes of graphs, to 
illustrate the use of the formulas in the previous sections. 


Example 11.12 (Cycles). Even cycles are trivial: If n is even, then a(C,) = 
0(Cn) = x(Cn) = n/2 and a(Cn) = 8(Cn) = x(Cn) = 2. To derive the theta- 
function on odd cycles, we can use Proposition [11.10 The eigenvalues of Cy, are 
2cos(2km/n) (k =0,1,...,n—1), of which k = 0 gives the largest (which is 2) and 
k = (n—1)/2 gives the smallest one (which is 2cos((n—1)m/n) = —2cos(m/n). 
Hence 


WC.) = ncos(7/n) 
1+cos(m/n) 
Since C, has a node-transitive automorphism group, this implies that 
O(Cn) =1+ — 
cos(7/n) 
In particular, 0(C,,) + 2 as n > oo. 4 


Example 11.13 (Kneser graphs). The Kneser graph K7 is defined on node set 
(nl), by connecting two k-sets if and only if they are disjoint (1 < k <n). Let us 
assume that n > 2k to exclude the trivial case of a graph with no edges. The set 
of k-sets containing any fixed element of [n] is stable, hence a(K;?) > Ga): The 
Erdés—Ko-Rado Theorem asserts that this is the exact value; this fact will follow 
from our considerations below. 

To compute the theta-function of this graph, we use the eigenvalues of its 
adjacency matrix. These are well known from coding theory: 


(11.30) wa"), (¢=0,1,...58): 


The multiplicity of eigenvalue 2, is (7) — (,", 
The largest eigenvalue in CS) (the degree of each node), while the smallest is the 
n—k-1 
ei): 


) (but this will not be important here). 


next one, —( 
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We apply the formula in Proposition }11.10} and get 


wee n-1 
(11.31) (Ky) < makin = ts 


Comparing this upper bound with the lower bound on a(K*), we see that they are 
equal, and so 


(11.32) 8(K®) = a(K?) = oa 


In particular, the Petersen graph K has J(K3) = 4. 
Since A; has a node-transitive automorphism group, it follows that 


son n n—-1 n 
(Ky) = (7) / =". 
(Ae) 6 & i) k 
This is quite close to a(K;,) = [n/k], but can be arbitrarily far from the chromatic 
number of K;’, which is known to be n—2k+2. 4 


Example 11.14 (Paley graphs). The Paley graph Pal, is defined for a prime p = 1 
(mod 4). We take the {0,1,...,p—1} as nodes, and connect two of them if their 
difference is a quadratic residue. It is clear that these graphs have a node-transitive 
automorphism group, and it is easy to see that they are self-complementary. So 
Corollary applies, and gives that v(Pal,) = \/p. To determine the stability 
number of Paley graphs is an unsolved number-theoretic problem; it is conjectured 
that a(Pal,,) = O((log p)?). ¢ 


Example 11.15 (Cycles with diagonals). For graphs with a node-transitive 
automorphism group, the existence of automorphism-invariant optima can be very 
useful. We illustrate this on the graph W,, obtained of an even cycle C, with its 
longest diagonals added. The graph Wg is just the Kuratowski graph K33, Ws 
is the Wagner graph from Example In fact, we can restrict our attention 
to the case when n = 4k is a multiple of 4, since otherwise W,, is bipartite and 
o(W,) = a(W,,) = n/2. We can observe the easy bounds 2k—1 < 0(W,,) < 2k, 
since W,, has 2k—1 nonadjacent nodes and can be covered by 2k edges. 

Let V(W,,) = {0,1,...,n—1}, where the nodes are labeled in the order of the 
original cycle. There exists an optimizing matrix U in that is invariant 
under rotation, which means that it has only three different entries: 


l+a, ifj-7t=+1 (mod 4k), 
Uiy = 414+), if j7-t=2k (mod 4k), 
1, otherwise. 


It is easy to see that for every (4k)-th root of unity ¢, the vector 
(1,¢,e7,...e"~') is an eigenvector of U. (This is a complex vector, so if we want 
to stay in the real field, we have to consider its real and imaginary parts; but it is 
more convenient here to do computations with complex vectors.) The eigenvalue 
X, corresponding to ¢ = e?7’"/" is easy to compute: 


Xo = 4k+2a+b, 
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and 


i 


Ae = ltaete? +--+. be 4678114... +ae""! = ale +e) +be* 


rt 
=) —+b(-1)" > 0). 
acos 5 +b(-1) (r > 0) 
We want to minimize max, \,. Each \, = 4,(a,b) is a linear function of a and b, 
so we can find J as the optimum of the linear program in 3 variables a, b and t: 


(11.33) J(W,,) = min max A,(a, b) = 


a,b Tr 


minimize t 

subject to A,(a,b) <t, r=1,...,n—1. 
There are many ways to do a back-of-the-envelope computation here; one gets that 
a = —k, b = —k+kcos(7/k) is an optimal solution, giving 


(11.34) 0(Wp) = k-+keos =. 


The main point in this example is to illustrate that for graphs with a node-transitive 
automorphism group, the value of the theta-function can be computed by a linear 
program, analogous to (11.33), where the number of unknowns is the number of 
orbits of the automorphism group on the edges. This may or may not lead to 
simple formula like in this case, but the computation is easy to perform, often even 
by hand. 4 


Example 11.16 (Self-polar polytopes). A polytope P C R? is called self-polar, 
if P* = —P. Note that this condition implies that for each vertex v, the inequality 
(—v)'x < 1 defines a facet Fy of P, and we obtain all facets this way. We call 
two vertices v and v’ of P opposite, if v’ lies on Fy. In other words, v'v’ = —1, 
which shows that this is a symmetric relation. We call the polytope strongly self- 
polar, if it is inscribed in a ball centered at the origin, in other words, there is 
an r > 0 such that |v| = r for all vertices v. For two opposite vertices we have 
1 = v'u < |v||u| = r?, and hence r > 1. It also follows that the distance of 
any facet from the origin is 1/r, so the sphere with radius 1/r about the origin is 
contained in P and touches every facet. 

In dimension 2, regular polygons with an odd number of vertices, with appro- 
priate edge length, are strongly self-polar. It was proved in that. for 
every dimension d and € > 0 there exist strongly self-polar polytopes inscribed in 
a sphere with radius r < 1+e. 

Let P be a strongly self-polar polytope in R%, and let G be the graph on 
V = V(P), in which two vertices are connected if and only of they are opposite. It 
was proved in that x(G) > d+1. 


We can estimate J(G) and 0(G) as follows. Let us label each vertex v of P 


with the vector 
_ 1 Vv 
VO fr? 41 \1) 


This is trivially a unit vector in R¢*+!, and uy | uy, for opposite vertices v and v’. 
So u is an orthonormal representation of G. Using the vector eg;; as handle, we 
see that 


0(G) < max : 


2 
a 
vo (ej41Uv)? 
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and 
n 


v(G) = D (eka)? = 741" 


In particular, we see that y(G) can be arbitrarily large while 0(G) can be arbitrarily 
close to 2. rY 


Example 11.17 (Random graphs). Consider the most basic random graph 
G(n, 1/2) (as usual, G(n,p) denotes a random graph on n nodes with edge prob- 
ability p). It is a nontrivial problem to determine the theta-function of a ran- 
dom graph, and not completely solved. To start with a heuristic, recall Corollary 
a selfcomplementary graph G with a node-transitive automorphism group 
has ¥(G) = /n. As a special case, the Paley graph Pal, is quasirandom (infor- 
mally, it behaves like a random graph in many respects) with edge-density 1/2 and 
0(Pal,) = ./p. 

The graph G(n,1/2) is, of course, not self-complementary, and its automor- 
phism group is trivial, with high probability. However, its distribution is invariant 
under complementation and also under all permutations of the nodes. Informally, it 
is difficult to distinguish it from its complement (as it is difficult to distinguish any 
two random graphs with the same edge-density), and apart from a little variance in 
the degrees, it is difficult to distinguish any two nodes. So perhaps it does behave 
like a self-complementary graph with a node-transitive automorphism group would! 

This heuristic predicts the right order of magnitude of )(G(n, 1/2)), namely it 
is of the order ,/n. However, no proof is known that would build on the heuristic 
above. It was proved in that with high probability, 


(11.35) svn < v(G(n, )) 23a, 


The method extends to estimating the theta-function for random graphs with 
other edge-densities. If p is a constant and n > oo, then with high probability, 


Ves < v(G(n,p)) <3 wee 


The analysis can be extended to the case when (Inn)!/°/n < p < 1—(Inn)'/°/n 


Coja-Oghlan—Taraz 2004] (see [Coja-Oghlan 2003] for more results about the con- 


centration of this value). 
We sketch the proof in the case p = 1/2. First, consider the upper bound. The 
proof of Juhasz refers to a result of |Furedi-Komlds 1981) bounding the eigenvalues 


of random matrices. Let A be the matrix defined by 


-1, ifij cE, 
Aij = . 
ales otherwise. 

Note that E(A;,;) = 0 and A}, =1fori #7. The matrix A satisfies the conditions in 
Proposition|11.9} and hence the results of [Firedi-Komldés 1981] provide the bound 
8(G) < Amax(A) < 3VA. 

To prove the lower bound, it suffices to invoke Lemma and apply the upper 
bound to the complementary graph: 


¥(G(n, 3)) = 0(G(n,3)) = ne md) > yg vn 


198 11. ORTHOGONAL REPRESENTATIONS: THE SMALLEST CONE 
with high probability. 4 


11.4. Stable sets 


11.4.1. Stability number and theta. We have introduced 3(G) as an upper 
bound on the stability number a(G). How good an approximation of the stability 
number is obtained this way? We have seen in Theorem [11.1] that 

a(G) < W(G) < x(G). 
But a(G) and x(G) can be very far apart, and unfortunately, the approximation of 
a by ¥ can be quite as poor. We have seen that for a random graph G with edge 
density 1/2, we have a(G) = O(Inn), but 0(G) = O(./n) (with high probability 
as n —+ co). Even worse examples can be constructed [Feige 1995]: sequences of 
graphs G for which a(G@) = n° and 0(G) = n!~°); in other words, 0/a can be 
larger than n!~* for every ¢ > 0. (The existence of such graphs also follows from 
the P 4 NP hypothesis and the results of [Hastad 1999], asserting that it is NP- 


hard to determine a(G) with a relative error less than n'~*.) By [Szegedy 1994], 


this also implies that 0(G) does not approximate the chromatic number within a 
factor of n1—€. 

These constructions do leave a little room for something interesting, namely in 
the cases when either a is very small, or if 0 is very large. There are indeed (rather 
weak, but useful) results in both cases. 


First, consider the case when a is very small. proved that 
if a(G) = 2 then 0(G) = O(n'/3), and constructed a graph with a(G) = 2 and 
0(G) = Q(n°) for c > 0. The matching lower bound of Q(n‘/*) was proved by Alon 
[Alon 1994], improving a slightly weaker lower bound of [Kashin—Konyagin 1981]). 


(The construction, which is quite involved, is not reproduced here.) For general 


a(G), [Alon—Kahale 1998] proved the following theorem: 
Theorem 11.18. Let G be a graph with n nodes and a(G) =k. Then 
0(G) < 16n**. 


To prove this theorem, we need a lemma that facilitates recurrence when bound- 
ing the theta-function. 


Lemma 11.19. Let G be a graph, and fori € V, let G; = G[N(i)]. Then 


a(G) <1+max ViN@| 9G. 
ve 
Proof. Let (vi,d) be an optimal dual orthonormal representation of G. By 
inequality (11.29), 
0(G) < max 5° |viv;|=1+max S> |v/vjl. 
J jE N(i) 
For any given 7, we use the Cauchy—Schwarz Inequality: 
20 _ 
(3° Wivil) <N@L YO wy)? < NOWG), 
jEN(i) jEN(i) 
since we can consider (v; : j € N(i)) as a dual orthonormal representation of G; 
with handle v;. Combining these two inequalities, we get the Lemma. 
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Proof of Theorem {11.18} The case k = 1 is trivial, so assume that k > 1. 
We use induction on the number of nodes. Let A = max; |N(i)| and G; = G[N(i)], 
then a(G;) < k—1 for every node i, and hence 0(G;) < 16A‘*-2)/*, So Lemma 


11.19]implies that 


0G) <1+max 1/|N(i)| 0(G;) <14+.VA-(16A***) = 144A. 


If A < 3nk/('+1) | then 
k-1 k-1 k k-1 k-1 k-1 
144A*8* < 144.3" nee <1412nFt < 16nFt, 
and we are done. If A > 3n*/(*+1), then let i be a node with |N(i)| = A, and 
consider the partition of the nodes into the sets 9; = N(i) and 92 = {i}UN(t). 
Note that a(G[S2]) < a(G)—1=k-1. So by induction on the number of nodes, 
8(G) < B(G[S,]) + 0(G[S2]) < 16A** +16(n— A) et 


k-1 
k+1 


_ = A Z = 
= 16A*F* + 16n*# (1- =) < 16A*F* +16 (1-7 
n 


k-1 2 
= 16nk +16A( ~k———_ 7 rir) 
+1 
ka ee 2 k-1 ko 
< 16n## +16An- (s-t--—) <16n i. 
k+1 


To motivate the next theorem, let us think of the case when the chromatic 
number is small, say x(G) < t, where we think of t as a constant. Trivially, a(G) > 
n/t, and nothing better can be said. From the weaker assumption that w(G) < t 
we get a much weaker bound: using the result of 
in the theory of off-diagonal Ramsey numbers, one gets (essentially) a power of n 
as a lower bound: 


(11.36) a(G) =2 (4°) 


The inequality 0(G) < t is a condition that is between the previous two, and the 
following theorem of SS a es does give a lower bound on 
a(G) that is better than (11.36) (for ¢ > 2). In addition, the proof provides a 
polynomial time randomized algorithm to construct a stable set of the appropriate 
size, whose use we will explain later. 


Theorem 11.20. Let G be a graph and t = 3(G). Then 
3 
ntti 
a(G) > ——. 
(2 10Vinn 


Proof. The case t = 2 is easy (cf. Exercise|11.1), so we assume that t > 2. By 
Theorem [IT.2} t is the strict vector chromatic number of G, and hence there are 


unit vectors u; € R* (in some dimension k) such that uJu; = —1/(t—1) whenever 
ie BE. 
Let w be a random point in R¢ whose coordinates are independent standard 


Gaussian random variables. Fix an s > 0, and consider the set S$ = {i: w'u; > 
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s}. The inner product w'u; has standard Gaussian distribution, and hence the 
probability that a given node belongs to S' is 


Qs) =e f eae, 


and the expected size of S is E|S| = Q(s)n. 

Next we show that S does not induce too many edges. Let 77 € EF, then 
Ju;+u,|? = (2t—4)/(t-1), and so the probability that both nodes u; and u; 
belong to S can be estimated as follows: 


2t—2 
P(wlu; > s, w'u; >s) < P(w'(u;t+u;) > 2s) = Q( 3 s). 


Hence the expected number of edges spanned by S' satisfies 


E|EIS]| < @( a )m, 


where m = |E|. We can delete at most |£[S]| nodes from S$ (one from each edge it 
induces) to get a stable set T with expected size 


E|T| > E(|S|—|E[S]|) = E|S|—E|E[S]| > Q(9)n—0(\[2s)m 


We want to choose s so that it maximizes the right hand side. By elementary 
computation we get that 


2t-—4, ™m 
s = 4/—lIn— 
t n 
is an approximately optimal choice. Using the well known estimates 
1 1 2 1 Ss 2 
11.37 se ASO 2 ee, 
(11.37) = NaS Fea 
we get 
1 t— t— 
a(G) > ig 8 
10\/In(m/n) 


If m < n@t+)/@¢+) | then this proves the theorem. 

If m > n@t+)/(+), then there is a node i with degree A > 2m/n > 2n*/(+1), 
Clearly )(G[N(i)]) < t—1 (see Baerioe[ LB ond hence by induction on n, G/N (¢) 
has a stable set of size at least A?/+/(10VIn A) > n3/+) /(10V/Inn). This proves 
the theorem in this case as well. 


11.4.2. The stable set polytope. Stable sets and cliques give rise to impor- 
tant polyhedra associated with graphs. After summarizing some basic properties 
of these polyhedra, we show that orthogonal representations provide an interesting 
related convex body, with nice duality properties. 

The stable set polytope STAB(G) of a graph G is the convex hull of incidence 
vectors of all stable sets. This gives us a polytope in RY. The stability number 
a(G) can be obtained by maximizing the linear function }),-y 7; over this polytope, 
which suggests that methods from linear programming can be used here. 

With this goal in mind, we have to find a system of linear inequalities whose 
solution set is exactly the polytope STAB(G). It would be best to find a minimal 
such system, which is unique. If we can find this system, then the task of computing 
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the stability number a(G) of G reduces to maximizing }7,-y 7; subject to these 
constraints, which means solving a linear program. Unfortunately, this system of 
linear inequalities is in general exponentially large and very complicated. But if we 
find at least some linear inequalities valid for the stable set polytope, then solving 
the linear program we get an upper bound on a(G), and for special graphs, we get 
the exact value. 

So we want to find linear inequalities (constraints) valid for the incidence vector 
of every stable set. We start with the trivial nonnegativity constraints: 


(11.38) a>0 (EV). 
The fact that the set is stable is reflected by the edge constraints: 
(11.39) Ui+X; <1 (ij E E). 


Inequalities (11.38) and (11.39) define the fractional stable set polytope FSTAB(G). 
Integral points in FSTAB(G) are exactly the incidence vectors of stable sets, but 


FSTAB(G) may have other (nonintegral) vertices, and is in general larger than 
STAB(G) (cf. Exercise|11.17). The case of equality has a nice characterization. 


Proposition 11.21. STAB(G) = FSTAB(G) if and only if G is bipartite. 


Let af(G) denote the maximum of )>,2; over x € FSTAB(G).  Trivially 
a(G) < af(G), and a/(G) is computable in polynomial time (since and 
describe a linear program defining a/). The difference af —a will play a 
role in Chapter [18] 

We can strengthen the edge constraints if the graph has larger cliques. Every 
clique B gives rise to a clique constraint: 


(11.40) Slee. 

i€B 
Inequalities (11.38) and define a polytope QSTAB(G), the clique- 
constrained fractional stable set polytope of G. Since cliques in G correspond to 
stable sets in G and vice versa, it is easy to see that QSTAB(G) is just the an- 
tiblocker of STAB(G) (cf. Section |C.3). 

Again, we can introduce a corresponding relaxation of the stability number, 
namely the quantity a*(G) defined as the maximum of )7, x; over x € QSTAB(G). 
This quantity is a sharper upper bound on a(G) then af, but it is NP-hard to 
compute [Grotschel-LovaszSchrijver 1984]. See also Exercise [11.19] 

The polytope QSTAB(G) is contained in FSTAB(G), but is still larger than 
STAB(G) in general. The case of equality leads us to an interesting and rich class 
of graphs, of which we give a very brief survey. 


11.4.3. Perfect graphs. Recall again the basic inequality 


a(G@) < WG) < x(@). 


For graphs with a(G) = x(G), we have equality here, so ? is an integer. To know 
this is useful; just to mention one consequence, calling the approximation algorithm 
in Theorem [11.11] with an error bound of 1/3 gives the exact value of V. 

But which graphs have this nice property? It turns out that the condition 
a(G) = x(G) does not say much about the structure of G, but a strengthened 
version of it leads to a very interesting class of graphs. A graph G is called perfect, 


if for every induced subgraph G’ of G, we have w(G’) = x(G’). Every bipartite 
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graph is perfect, since they satisfy w(G) = x(G) = 2 (if they have an edge) or 
w(G) = x(G) = 1 (if they have no edge), and their induced subgraphs are also 
bipartite. Figure shows some perfect and nonperfect graphs. 


Xe 


FIGURE 11.2. Some perfect graphs (first row) and some nonperfect 
graphs (second row). 


To be perfect is a rather strong structural property; nevertheless, many inter- 
esting classes of graphs are perfect (bipartite graphs, their complements and their 
linegraphs, interval graphs, comparability and incomparability graphs of posets, 
chordal graphs, split graphs, etc.). 

The following deep characterization of perfect graphs was conjectured by 
Berge in 1961 and proved by Chudnovski, Robertson, Seymour and Thomas 


Chudnovsky et al. 2006}. 


Theorem 11.22 (Strong Perfect Graph Theorem). A graph is perfect if and 
only if neither the graph nor its complement contains a chordless odd cycle longer 
than 3. 


As a corollary we can state the “Weak Perfect Graph Theorem” proved earlier 


Lovasz 1972): 


Theorem 11.23. The complement of a perfect graph is perfect. 


From this theorem it follows that in the definition of perfect graphs we could 
replace the equation w(G’) = y(G’) by a(G’) = x(G’). Perfectness can also be 
characterized in terms of the stable set polytope |Chvatal 1975]: 


Theorem 11.24. STAB(G) = QSTAB(G) if and only if G is perfect. 
Based on our remark above, the condition STAB(G) = QSTAB(G) is equiva- 


lent to saying that STAB(G) and STAB(G) are antiblockers, which is a condition 
symmetric in G and G. So Theorem|11.24|implies Theorem |11.23 


Turning to algorithms, Theorem|11.11}implies: 


Corollary 11.25. The stability number and the chromatic number of a perfect 
graph are polynomial time computable. 


Using the algorithms of Corollary|11.25}one can compute more than just these 
values: one can compute a maximum stable set and an optimal coloring in a perfect 


graph in polynomial time (see Exercise }11.18). 
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It is interesting (and somewhat frustrating) that the only other way of comput- 
ing the stability number of a perfect graph in polynomial time is to use the very ad- 
vanced structure theory of perfect graphs, developed more recently by Chudnovski, 
Robertson, Seymour and Thomas {Chudnovsky et al. 2006]. This deep theory is 
combinatorial, but quite involved. 

Theorem[I1.11]extends to the weighted version of the theta-function. Maximiz- 
ing a linear function over STAB(G) or QSTAB(G) is NP-hard; but, surprisingly, 
TSTAB behaves much better: Every linear objective function can be maximized 
over TSTAB(G) (with an arbitrarily small error) in polynomial time. This ap- 
plies in particular to 0(G), which is the maximum of 5°, z; over TSTAB(G). See 


Grotschel—Lovasz—Schrijver 1988} |Knuth 1994] for more detail. 


11.4.4. Orthogonality constraints. For every orthonormal representation 
(u;,c) of G, we consider the linear constraint 


(11.41) So (ctuj)?2; <1, 
ieV 

which we call an orthogonality constraint. The solution set of nonnegativity and 
orthogonality constraints is denoted by TSTAB(G). It is clear that TSTAB is 
a closed, full-dimensional, convex set. The orthogonality constraints are valid if 
x is the indicator vector of a stable set of nodes (cf. (11.4)), and therefore they 
are valid for STAB(G). Furthermore, every clique constraint is an orthogonality 
constraint. Indeed, for every clique B, the constraint >),-, 7; < 1 is obtained from 
the orthogonal representation 


i ie B, 

Lb . c=e,. 
e;, otherwise, 

Hence 

(11.42) STAB(G) C TSTAB(G) C QSTAB(G) 


for every graph G. 

There are several other characterizations of TSTAB. These are based on an 
extension of the theta-function to the case when we are also given a weighting 
w: V—+R4. Generalizing the formulas (11.12)-(11.16), the quantity 0(G, w) can 


be defined by any of the following formulas |Grotschel—Lovasz—Schrijver 1986}: 
: u ONR of G, |e] = 1} 


‘ Wi 
(11.43) J(G,w)= min{ max (ctu)? 


(11.44) = min{t >2:ly/? =t—w,, yly; =— wry (ij € B)} 
(11.45) 7 min{max(Yis + wi) : Y=0, Viz =—/wiw; (ij € E)} 
(11.46) = max{ S> wiwjZij: Z>0, Ziy =0 (if € BE), Yo Zu = i} 
igjeV i 
(11.47) = max{ )> w;(d"v;)? : v ONR of G, |d| = i. 
iEV 


The equivalence of (11.43)—(11.47) can be obtained extending the proof Theorem 
11.2|to the node-weighted version (at the cost of a little more computation), which is 
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described in the book |Grotschel—Lovasz—Schrijver 1988], and we do not reproduce 


the proof here. 


Similarly as in the unweighted case, the optima in (11.43) and (11.47) satisfy 
the relations 
(11.48) cd’ = 5 °(c'u,)(d'vi)uiv; 

i 

and its consequences formulated there. (The node-weights don’t enter this relation.) 

For every orthonormal representation u = (u;: i € V) with handle c, we call 
the vector ((cTu;)? 2 te V) the profile of the representation. We can state two 
further characterizations of TSTAB(G): 


Proposition 11.26. (a) zc € TSTAB(G) if and only if 0(G,x) < 1. 
(b) The body TSTAB(G) is exactly the set of profiles of dual orthonormal rep- 
resentations of G. 


Proof. (a) follows from (11.47). 


(b) The profile of every dual orthonormal representation belongs to TSTAB(G); 
this is equivalent to (11.11). Conversely, let 2 € TSTAB(G). Then 0(G, x) < 1 by 
(a), so implies that there is a dual orthonormal representation v of G with 
handle d for which x; < d'v; for all nodes i € V. Thus the vectors vi = (a; / d'v;)v; 
satisfy d'v/ = a;. The vectors v/ are not of unit length, but the vectors 


v. d 
My uv d” = 
“= (tape) a= (9) 


form a dual orthonormal representation of G with profile x. 


The last characterization of TSTAB(G) is equivalent to the following duality 
result. 


Corollary 11.27. TSTAB(G) is the antiblocker of TSTAB(G). 


Next we determine the vertices of TSTAB(G). Recall that a vertex of a con- 
vex body K is a boundary point v that is the unique point of intersection of all 
hyperplanes supporting K at v. This means that there is a pointed convex cone 
containing K with v as its vertex. This is to be distinguished from an extreme 
point, which is the unique point of intersection of a hyperplane supporting K with 
K. 


Theorem 11.28. The vertices of TSTAB(G) are exactly the incidence vectors of 
stable sets in G. 


Does this imply that TSTAB(G) = STAB(G)? Of course not, since TSTAB(G) 
(as every convex body) is the convex hull of its extreme points, but not necessarily 
of its vertices. 


Proof. The vector 14, where A is a stable set of nodes, is the unique common 
point of n supporting hyperplanes 2; = 1 (i € A) and 2; = 0 (i € V\A), and so it 
is a vertex of TSTAB(G). 

Conversely, let z = (z; : i € V) be a vertex of TSTAB(G). If z; = 0 for 
some node 7, then we can delete i: We get a graph G’ for which TSTAB(G’) = 
TSTAB(G)N{z; = O}, and so z|y\; is a vertex of TSTAB(G"), and we can proceed 
by induction. 
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So we may assume that z; > 0 for alli € V. Since z € TSTAB(G), we can 
write z; = (d'v;)? for some dual orthonormal representation (v;: i € V) of G and 
unit vector d. 

Let a'x < 1 be a hyperplane that supports TSTAB(G) at z. Then a € 
TSTAB(G) by Corollary [11.27 and hence there is an orthonormal representation 


(u;: 7 € V) of G and unit vector c such that a; = (c'u;)?. From (11.48), we get 
d= S-(clu)’(d'vi)vi. 
Multiplying by any vector y, we get 


d'y = S-(ctui)?(d"vi)(viy) = S- a;(d'v;)(v/y). 
7 a 

Thus if y is not orthogonal to d, then the point z’ defined by z = 
(d'v;)(v/y)/(d'y) is contained in the supporting hyperplane )>,a;2; = 1. This 
holds for every supporting hyperplane at z. Since z is a vertex, the only com- 
mon point of hyperplanes supporting TSTAB(G) at z is z itself (here we use 
that z is a vertex, not just an extreme point). Thus 2’ = z, which means that 
viy = (d'v;)(d'y) for all i (here we use that z; 4 0 for all i). This relation holds 
for almost all y, which implies that v; = (d'v;)d, and since v; is a unit vector, we 
get v; = d. So no two vectors v; are orthogonal, and thus G has no edges. But 
then z = ly is the incidence vector of a stable set as claimed. 


Example 11.29. Consider the graph Cs, with node set {1,...,5}. The polytope 
STAB(Cs) has 11 vertices (the origin, the basic unit vectors, and the incidence 
vectors of nonadjacent pairs of nodes). The facets are defined by the nonnegativity 
constraints, edge constraints, and single further inequality 


(11.49) %1+%9+%3+%44+25 < 2. 


Since Cs has no triangles, we have QSTAB = FSTAB. This polytope has a single 
vertex (4,..., 5) in addition to the incidence vectors of stable sets. 


Turning to TSTAB, we know by Theorem |11.28] that it has 11 vertices, the 
same vertices as STAB. The umbrella construction in Example gives a point 


(11.50) (Fava) € TSTAB(Cs) 

which is not in STAB(G) by (11.49). Applying the umbrella construction to the 

complement, and scaling, we get an orthogonality constraint 

(11.51) ayte+a5 < V5, 

showing that the special vertex of FSTAB(C;) does not belong to TSTAB(Cs5). @ 
This example shows that not every orthogonality constraint follows from the 


clique constraints. In fact, the number of essential orthogonality constraints is 
infinite unless the graph is perfect. 


Proposition 11.30. TSTAB(G) is polyhedral if and only if the graph is perfect. 
Proof. If G is perfect, then STAB(G) = QSTAB(G) by Theorem 11.24) and 
(11.42) implies that TSTAB(G) = STAB(G) = QSTAB(G), so TSTAB(G) is poly- 
hedral. To prove the converse, suppose that TSTAB(G) is polyhedral, then Theo- 
rem|11.28]implies that TSTAB(G) = STAB(G). We can apply this argument to G, 
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since the antiblocker TSTAB(G)*”! = TSTAB(G) is also polyhedral; we get. that 
TSTAB(G) = TSTAB(G)*"! = STAB(G)*"! = QSTAB(G). 


So STAB(G) = TSTAB(G) = QSTAB(G), which implies that G is perfect by 
Theorem [11.24 


11.5. Applications 


11.5.1. Shannon capacity. In the introduction, we have described how to 
use orthogonal representations to determine the Shannon zero-error capacity of the 
pentagon. What happens with other confusion graphs? 

Let V be an alphabet with confusion graph G = (V,E). To describe the 
confusion graph of longer messages, we use the strong product of two graphs. In 
these terms, a(G™") is the maximum number of non-confusable words of length k: 
words composed of elements of V, so that for every two words there is at least one 
i (1 <i <k) such that the i-th letters are different and nonadjacent in G, i.e., 
non-confusable. It is easy to see that 


(11.52) a(GRH) > a(G)a(A). 
This implies that 

(11.53) Ge) > a(G™ alG™), 
and hence 

(11.54) aC") >a(Gy* 


The Shannon capacity (zero-error capacity, if we want to be pedantic) of a 
graph G is the value 
(11.55) @(G) = lim a(G™*)1/*, 


co 


Inequality (11.53) implies, via Fekete’s Lemma, that the limit exists, and (11.54) 
implies that 


(11.56) @(G) > a(G). 


Rather little is known about this graph parameter for general graphs. For example, 
it is not known whether O(G) can be computed for all graphs by any algorithm 
(polynomial or not), although there are several special classes of graphs for which 
this is not hard. The behavior of O(G) and the convergence in are rather 
erratic; see [Alon 1998) |Alon—Lubetzky 2006). 

Let us describe a few facts we do know. First, let us generalize the argument 
from the Introduction bounding 0(C4). Let 7(G) denote the minimum number 
of complete subgraphs covering the nodes of G (this is the same as the chromatic 
number of the complementary graph.) Trivially 


(11.57) a(G) < x(G). 


Any covering of G by X(G) cliques and of H by Y(#) cliques gives a “product 
covering” of GX H by X(G)xX(#) cliques, and so 


(11.58) X(GWH) < xX(G)xX(#). 


Hence 
a(G*) <x(G™*) < x(a), 
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and thus 
(11.59) O(G) < x(G). 


It follows that if a(G) = x(G), then O(G) = a(G); for such graphs, nothing better 

can be done than reducing the alphabet to the largest mutually non-confusable 

subset. In particular, this answers the Shannon capacity problem for perfect graphs. 
Instead of ¥, we can use J? to bound the Shannon capacity: 


oG*) < W(G*) < WG), 
which implies 
Proposition 11.31. For every graph G, 
O(G) < WG). 


Since 0(C5) = V5, we get that equality holds in Example {1.2} @Q(C5) = V5. 
This argument can be generalized to an infinite class of graphs: 


Corollary 11.32. If G is a self-complementary graph with a node-transitive auto- 
morphism group, then O(G) = Vn. 


Proof. The diagonal in GWG is stable, so a(GHG) = a(GRG) > n, and 
hence @(G) > \/n. On the other hand, 0(G) < 0(G) = V/n by Corollary[11.7| 


Example 11.33 (Paley graphs II). Paley graphs form a class of graphs to which 
this corollary applies, and whose Shannon capacity can be determined exactly: 
@(Pal,) = 0(Pal,) = \/p. Assuming that the stability number of a Paley graph is 
polylogarithmic in p (as conjectured), for this infinite family of graphs the Shannon 
capacity is much higher than the stability number. 4 


The tensor product construction in the proof of Theorem shows that if 
G has an orthonormal representation in dimension c, and H has an orthonormal 
representation in dimension d, the GXH has an orthonormal representation in 
dimension cd. It follows that the minimum dimension of any orthonormal repre- 
sentation is an upper bound on @(G). Exercise[11.2|shows that this bound is never 
better than 0(G). 

However, if we consider orthogonal representations over fields of finite charac- 
teristic, then the analogue of J is not defined, but the minimum dimension in which 
an orthogonal representation exists may provide a better bound on the Shannon 
capacity than 0. More generally, consider a graph G and a Vx V matrix A such 
that Aj; ¢ 0 for every node i, and A;; = 0 for every pair i, 7 of nonadjacent nodes. 
Haemers proved that rk(A) is an upper bound on the Shannon ca- 
pacity of G, and constructed matrices (over an appropriate finite field) for which 
this bound is better that J (see ie are examples where such a 
matrix has rank 3 while 0 = Q(n‘/4) [Alon 2018]. Random graphs behave in the 
opposite way: by Example 0(G(n,1/2)) = O(n'/?) with high probability, 
while for every fixed field (finite or infinite), every matrix A with the properties 
above has rank 2(n/logn) [Alon at_al. 2018). 

We'll return to the Shannon capacity in a quantum communication setting in 


Chapter 


208 11. ORTHOGONAL REPRESENTATIONS: THE SMALLEST CONE 


11.5.2. Approximate coloring. Suppose that somebody gives us a graph 
and guarantees that the graph is 3-colorable, without telling us the 3-coloring it- 
self. Can we find this 3-coloring? (This “hidden 3-coloring problem” may sound 
artificial, but this kind of situation does arise in cryptography and other data se- 
curity applications; one can think of the hidden 3-coloring as a “watermark” that 
can be verified if we know where to look.) 

It is easy to argue that knowing that the graph is 3-colorable does not help: it 
is still NP-hard to find the 3-coloration. But suppose that we would be satisfied 
with finding a 4-coloration, or 5-coloration, or (log n)-coloration; is this easier? It is 
known that to find a 4-coloration is still NP-hard , but little 
is known above this. Improving earlier results, gave 
a polynomial time algorithm that, given a 3-colorable graph, computes a coloring 
with O(n'/4(Inn)°/?) colors. This was improved in several papers; the current. best 


exponent is O(n®19%6) [Kawarabayashi Thorup 2014). 

We sketch the algorithm of |Karger-Motwani-Sudan 1994). Theorem 
implies that G contains a stable set of size at least n3/4/(10V/Inn), and the proof 
can be converted to a randomized polynomial time algorithm to find such a set. 
Color this set with one color, delete its nodes, and repeat the procedure with the 
remaining graph Gj. 

We claim that this procedure results in a coloring of G with 40n!/+V/In n colors. 
Indeed, the number of nodes in G is 


3/4 


1 
10Vlnn ( ae) 


then by induction, the number of colors we use is at most 


ny <n—-10 


1 1/4 
1+40ni/4 Inn, < 1440n4/4(1-_—____ Inn 
10n!/4,/Inn 
1 
< 1440n'/4 (1 — —__—)vinn = 40n!/4VIn n. 
40n!/4V/Inn 


Exercise 11.1. If )(G) = 2, then G is bipartite. 


Exercise 11.2. Prove that the minimum dimension in which a graph G has an 
orthonormal representation is at least 0(G). 


Exercise 11.3. Let G be a graph and v € V. (a) ¥(G\v) > 0(G)—1. (b) If v is 
an isolated node, then 0(G\v) = 0(G)—1. (c) If v is adjacent to all other nodes, 
then }(G\v) = BG). 

Exercise 11.4. Let G be a graph and let V = S,U---US, be a partition of V. 
(a) 0(G) < 30, ¥(G[Si]). (b) If no edge connects nodes in different sets S;, then 
equality holds. (c) Suppose that any two nodes in different sets S; are adjacent. 
How can 0(G) be expressed in terms of the 0(G[S;])? 


Exercise 11.5. Prove that the graph parameter 0 (G) introduced in Remark 
could be defined in any of the following ways: (a) tightening the conditions 
by requiring that v/v; > 0 for all i and j; (b) relaxing the conditions 
) by requiring only w/w; < —1/(t—1) for ij € E; (c) relaxing the 
orthogonality conditions in by requiring only uju,; < 0 for ij € E. 
Exercise 11.6. Formulate and prove the analogue of Exercise[I1.5] for the graph 
parameter 0+ (GQ). 


Exercise 11.7. Prove that 0” (G)0*t (G) > n for every graph G. 
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Exercise 11.8. Prove the following weak converse to Proposition [11.4] 
Szegedy 1994]: Every graph G has a nonempty subset S' C V such that 


= 1 1 |S| 
w(G) < (145+---+=) , 
(G)s 2 n/ 3(G[S]) 
Exercise 11.9. Prove that allowing orthogonal representations in the complex 
Hilbert space (define!) does not change the value defined by formula (11.16). 


Exercise 11.10. Prove the formula (11.34) for even cycles with their longest 
diagonals added, by showing that the solution given above it is optimal. 
Exercise 11.11. Construct an optimal dual orthogonal representation for the 
Kneser graph K;. 
Exercise 11.12. Let vi,...,Vn € R” be any set of vectors, and let A = Gram(v). 
Prove that 

mS max{ ) (d's)? : deR?, |d= a}. 


t 


Exercise 11.13. With the notation of Lemma|11.19| every graph G satisfies 
HG)(HG)-1P <>) 9G)?. 


ieVv 
Exercise 11.14. Prove that the length of the sum of n unit vectors such that 
among any three of them some two are orthogonal is at most 4n?/3. Prove that 
this is best possible up to the constant. 
Exercise 11.15. (a) Show that any stable set S provides a feasible solution of 
the dual program in (11.6). (b) Show that any k-coloring of G provides a feasible 
solution of the primal program in (11-6). (c) Give a new proof of the Sandwich 
Theorem [11.1] based on (a) and (b). 
Exercise 11.16. Let G be a graph and let A be a V x V matrix such that Aj; 4 0 
for every node i but A;; = 0 for every pair i, 7 of nonadjacent nodes. (a) Prove 
that a(G) < rk(A); (b) O(G) < rk(A); (c) These estimates are valid for matrices 
A over any field; (d*) Construct a graph G and a matrix A as above (over some 
field) for which rk(A) < 0(G). 
Exercise 11.17. Prove that the vertices of FSTAB(G) are half-integral, and show 
by an example that they are not always integral. 
Exercise 11.18. Show that for a perfect graph, a maximum stable set and a 
coloring with minimum number of colors can be computed in polynomial time. 
Exercise 11.19. The fractional chromatic number x*(G) is defined as the least 
real number ¢ for which there exists a family (A; : j = 1,...,p) of stable sets 
in G, and nonnegative weights (7; : j = 1,...,p) such that )),7; = t¢ and 
2; Ta; = 1v. The fractional clique number w*(G) is the largest real number s 
for which there exist nonnegative weights (0; : 1 © V) such that 57,0; = s and 
ica a; <1 for every stable set A. 
(a) Prove that w(G) < w*(G) and x(G) > x*(G). 
(b) Prove that y*(G) = w*(G). 


(b) Prove that 0(G) < y*(G). 


209 


CHAPTER 12 


Orthogonal Representations: Quantum Physics 


As the basic setup in quantum physics, the state of a physical system can be 
described by a vector of unit norm in a (complex) Hilbert space. (In the simple 
systems we need, this space will be finite dimensional.) A measurement on a system 
in quantum state is performed by applying a self-adjoint linear operator to it. The 
simplest measurement operator is the orthogonal projection of the state vector onto 
a one-dimensional subspace; such an experiment can be thought of as checking a 
particular property of the state x. If u is the vector generating this subspace (which 
is unique up to a scalar of absolute value 1), then the probability that the property 
checks out is the squared inner product of u and x. 

The fact that quantum physics assigns vectors to discrete objects like “proper- 
ties” suggests analogies with orthogonal representations and other geometric repre- 
sentations treated in this book. As it turns out, this is more than just an analogy. 

The most successful area of applying quantum physics in computer science, at 
least so far, has been quantum information theory. Using phenomena of quantum 
physics (for example, entanglement of particles), one can create communication 
channels that are more efficient than classical communication channels. We will 
discuss a quantum version of the Shannon capacity problem from the Introduction, 
along with two further interesting applications of orthogonal representations, to the 
theory of hidden variables, and in the construction of strangely entangled states. 


12.1. Preliminaries: quantum states and entanglement 


As it happens, the applications we are going to discuss are all about entangle- 
ment, so let us introduce this notion first. 

We write u-v for the inner product of two vectors u,v € C4, to emphasize that 
it is used in the Hilbert space sense: u-v = )7), %jv;. However, allowing complex 
entries will not play any important rolef!] 

The simplest example of a quantum physical system is a qubit: a qubit is a unit 
vector in C?. We fix a basis {e9,e,} in C?. The vector eg corresponds (in a sense) 
to the logical value “NO” of the bit, while e;, to the value “TRUE”. Qubits are 
the basic units of quantum computation and quantum information theory, just like 
bits are for the classical versions. 

Consider two quantum physical systems A and B. Separately, their states can 
be described by unit vectors x € C4 and y € C°. The state of the union of the two 
systems can be described by a vector in the tensor product space C4@C°. If the 
two systems in states x and y are “independent” (unentangled), their joint state 


lWe digress from the standard physics notation, where a row vector is denoted by (z|, a 
column vector, by |y), and their inner product by (a|y). This would make reading this chapter 
easier for physicists, but not for mathematicians. Since most of this book is about mathematics, 
I do not change notation for this single chapter. 
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is xoy, which is then called a product state. However, this will not be the case in 
general, as a vector z € C?7@C*® cannot generally be written as the tensor product 
of two vectors in C¢. A state that cannot be written as such a tensor product is 
called entangled. Every state in C4@C° can be written as 


d e 
z=) >) Sis(eiof;), 


i=1 j=1 


where {e;,...,@q} is the standard basis in C%, and f,,...,f is the standard basis 
in C*. The coefficients S;; are complex numbers with >); ; |S;;|? = 1. Sometimes 
it is convenient to describe z by the dx e complex matrix S = (5;;). In the case of 
two isomorphic systems (so e = d), the state 


is maximally entangled: in a sense, farthest from a product state. The rank of 
the matrix S is 1 in the case of a product state, and d in the case of the above 
maximally entangled state (when the matrix is the identity). 

Measurement in a quantum system is a complicated issue, but we need only a 
very simple special case. We have mentioned that checking a property of the quan- 
tum state x corresponds to projecting it onto a one-dimensional subspace. Given 
mutually orthogonal vectors aj,...,a, € C%, we can think of them as represent- 
ing q different values of the same property, which are mutually exclusive (this is 
not hard to see). We can check these properties simultaneously (since projections 
to these one-dimensional subspaces commute), and tell which of them holds: the 
property described by a; will be found to hold with probability |a;-x|?. If ¢ = d, 
then exactly one of them will hold. 


12.2. Capacity of quantum channels 


Entanglement leads to rather paradoxical behavior of particles; this was pointed 
out by Einstein, Podolsky and Rosen in 1935. Consider a pair of maximally en- 
tangled particles with a 2-dimensional state space: say two electrons, whose spin 
can be either “up” or “down”. The maximally entangled state to consider here is 
wel oe, + eer oe. Such a pair of particles is often called an Einstein—Podolsky— 
Rosen pair or EPR pair. Both elements of such a pair represent a qubit: the same 
qubit for each. 

Suppose that Alice and Bob split an EPR pair between themselves (while it 
remains in the same entangled state), and they travel to different far away places. If 
Alice measures the state of her electron, she will find it in one of the states e; and e2 
with the same probability. Say it is in state e,, then the entangled state collapses to 
e,0e; immediately. These long range correlated events could be interpreted from 
the point of view of Alice as an action apparently faster than light, so it would 
contradict special relativity - but only apparently. If Bob measures the state of his 
electron, then it will be in state e;; but if they do the measurements in different 
order what they observe will come out the same. You can play around with more 
EPR pairs to convince yourself that no information can be transmitted between 
Alice and Bob using only such pairs. 
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While splitting an EPR pair cannot be used to transmit information faster than 
light, such a strange behavior can be utilized in information theory and computer 
science. One important feature of sharing such an EPR pair is that Alice and 
Bob obtain the same random bit, which is very secure, since by the principles of 
quantum physics, nobody can learn this bit without destroying the entanglement. 

Here we utilize EPR pairs in a different way. One can ask for analogues of the 
Shannon zero-error capacity in quantum information theory. Let us start with an 
example showing that using entanglement does improve the efficiency of communi- 
cation in certain cases. 


Example 12.1. While entanglement does not allow to send information directly 
(faster than the speed of light) between two people, it does allow to make bet- 
ter use of classical communication channels. The following example is due to 
Cubitt et_al. 2010). 

We need a graph G with the following properties: 


(a) its edges can be covered by r complete d-subgraphs H1,..., H,-; 
(b) some q of these, say H),..., Hq, partition the node set; 

(c) a(G) <q; 

(d) G has a dual orthonormal representation v in R?. 


Several constructions of such graphs are known, it is perhaps easiest to describe 
such a graph with d = 4 and g = 6 [Peres 1991]. The node set of this graph can be 
defined as the set of vectors in R+ with coordinates 0,1 or —1, where the number of 
nonzero coordinates is 1,2 or 4, and the first nonzero coordinate is 1. We connect 
two of these nodes by an edge if they are orthogonal. By definition, this graph has 
a dual orthonormal representation in R*. To verify that properties (a)—(c) above 
are satisfied as well is a tedious but routine exercise. 

Next we construct a (noisy) classical communication channel: Its input alpha- 
bet is V, its output alphabet is [r], and on input a € V it outputs one of the indices 
i for which a € V(H;). The output 7 is chosen randomly and uniformly from all 
such 7. Thus two inputs a and b are confusable if and only if there is a chance that 
they lead to the same output, i.e., if there is an 7 € [r] such that a,b € V(H;). This 
is equivalent to ab € E, so the confusability graph of the channel is G. The number 
of inputs that can be used without any danger of confusion is a(G) < q. 

Now we endow the channel with an extra feature: we construct two copies a 
quantum system with a d-dimensional state space C%, and prepare them by bringing 
them to the above maximally entangled state. Alice and Bob get one of the two 
copies each. 

We use the dual orthonormal representation to show that in the presence of 
such entanglement, one can safely transmit q different one-letter messages. Alice 
wants to send message x € [|g] to Bob. She measures the state of her side in 
the orthonormal basis {v, : s € V(H,)}. As discussed above, the result of the 
measurement will be a random element v, of this basis. After that, the entangled 
system will be in the state v,ovs. 

Alice sends s to Bob through the noisy channel. Bob receives an index i for 
which V(H;) 5 s. (Ifi < q, this would be the intended information x, but the noisy 
channel may have output any such index i.) Since H; is complete, Bob can measure 
the state of his side in the orthonormal basis {v;: t € V(H;)}, and determine s. 
Since there is a unique x € [gq] for which s € V(H,.), this determines «x. 
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It is interesting to notice that while in Section as well as in the rest 
of this section, orthogonal representations are used to prove upper bounds on the 
capacity of a channel, here we have used dual orthogonal representations to design 
better protocols. 4 


So we see that using an entangled state can improve the zero-error capacity of a 
noisy channel. How much is the improvement? and 
prove that the theta-function is still an upper bound. One can use an entanglement- 
assisted channel repeatedly to gain in its capacity just like in the classical case. Since 
the theta-function is multiplicative, it remains an upper bound on the zero-error 
capacity. 


Theorem 12.2. Suppose that Alice and Bob are connected by a classical noisy 
channel with confusability graph G. In addition, there is an entangled state u € 
C40C4, where Alice has access to the first factor and Bob has access to the second 
factor. Then the maximum number of one-letter messages that Alice can transmit 
without the possibility of confusion is bounded by 0(G). 


Proof. Let the noisy channel have input alphabet X and output alphabet 
Z. For each input 2, let Z, C Z be the set of outputs that occur with positive 
probability. (Since we are interested in zero-error, the actual probabilities do not 
matter.) Two elements x and y are confusable if and only if 2,9 Z, 4 0. 

The other part of the equipment is the shared entangled state u € C¢@C4%, 


which we consider as a dxd complex matrix S such that S-S = tr(S5') = 
are [SuaF =1. 

The use of this entanglement-assisted channel can be described like this. Alice 
wants to transmit a message 7 € [m]. She performs a measurement on her half of 
u; this collapses the joint state to a state u’. She transmits a message x € X that 
depends on i and the result of the measurement. Once Bob gets the message (more 
exactly, he gets some message z € Z,,), he performs a measurement (depending on z) 
on his half of u’, which has m possible outcomes; the outcome of this measurement 
is supposed to coincide with Alice’s intended message i. 

To set up the linear algebra for the analysis of this protocol, let Alice apply 
an operator A;®J to u. As the result of the measurement she gets an eigenvector 
of A;. Let A? be the orthogonal projection onto the subspace of C% generated by 
those eigenvectors that lead to message x. The fact that always exactly one message 
x € X must be sent means, by the laws of quantum physics, that A? A? = 0 for 
x A#yand >>, A? =I for each « € X. This measurement collapses u to the state 
u’ described by the matrix A?S. 

Bob’s measurements consist of m orthogonal projections B7,...,B%, onto ap- 
propriate subspaces of C%. Similarly as before, we have B7B; = 0 fori # 7 and 
>); B7 =. Measurement B7 collapses the state u’ to A? SB. The fact that Bob 
is able to recover i means that A7 SB? = 0 if z € Z, andi Fj. 

We claim that 


(12.1) (A7'S)- (APS) =0 
in the following cases: (a) i= j, «Ay; (KF) i AJ, c=y; (C)iF#J, cy € E. Case 


(a) is easy, since we can write (A?S)- (A/S) = tr(S' AP AYS), and, as we have seen, 
At Aj = 0. In cases (b) and (c) there is an element z € Z,Z,, and using this we 
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get 
(APS) -(AYS) = tr(AYSS' AP) = tr(A¥S(S > Bz)S" AP) 
k 
= > tr(A¥SBz5" A?) = 0, 
k 


since for every k, either AfS Bz = 0 or BzS'A’ = 0. This proves (12.1). 
Define an X x X matrix M by 


m 


Mzy = >, (A7S)-(A¥S). 


i,j=l 
If cy € E, then M;,, = 0, since every term is zero by conditions (a) and (c) above. 
It is easy to see that M is positive semidefinite. Furthermore, using (b), 


tr(M) =~ YS (A?S)-(A?S) = 5S) 8-5 =m, 
Z i,g=1 x 
and 
tr(JM) = > tr(S"APAYS) =) tr(5"(D> AP) (> 44) 5) 
L,Yst,J i,j x y 
= m?tr(S'S) ne 
This shows that the matrix 1M almost fulfils definition of J, except that 


—T 
it is not necessarily real. But Z = x (M +M_) is real, and has the same trace and 
sum of entries, so 0(G) > m. 


One can consider quantum physical communication channels more general than 
entanglement-assisted classical channels. The theta-function can be generalized to 
such channels, so that it remains an upper bound on the zero-error capacity of the 


channel |Duan—Severini-Winter 2013]; see also [Cubitt et_al. 2011). 


12.3. Hidden variables 


Let us return to the Einstein—Podolsky—Rosen objection to quantum physics. 
One way out of the paradox of sustained correlations between the results of inde- 
pendently performed, spatially separated quantum measurements is the theory of 
hidden variables. Perhaps the states of the electrons of Alice and Bob are already 
determined when the pair is formed? Then there is nothing surprising in the fact 
that the two particles are found in the same state. In the well known parable of 
Schroédinger’s cat, this argument would mean that the cat in the closed chamber at 
a given moment in time is either dead or alive—we just do not know which. 

This interpretation, which arose from the objection to the nondeterministic— 
random interpretation of quantum events, suggests that if we knew the exact state 
of each particle (its “hidden parameters”), then we could predict quantum events 
with certainty. A consequence would be that even if we are not able to observe 
certain events simultaneously, they do actually occur. 

Based on the work in |Cabello—Severini-Winter 2014], we describe a connec- 
tion between the theta-function and this fundamental issue in quantum physics. 
Consider a quantum system, and let e€),...,¢€,, be observable events. Construct a 
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graph G on V = [n] in which ij € E if and only if e; and e,; are exclusive (cannot 
occur simultaneously). We call G the exclusivity graph of the system e1,...,€n, of 
events. 

How many of these events occur? 

We can observe any particular event e;, but this observation changes the state 
of the system, so we cannot observe all of the events. If we repeat the experiment, 
even if we can create the same starting state and observe the same event, we may 
get a different answer. But if we choose the event e; uniformly at random from the 
set {e1,...,€n}, and we repeat the experiment with the same starting state many 
times, then we can find, experimentally, the probability p that the observed event 
does occur. The expected number of events that occur is then pn. 

What can we say about p? In the classical setting, when e1,...,e, are observ- 
able events in a probability space (no quantum effects), we would have a probability 
distribution on the stable subsets of nodes of G. The number pn would be the ex- 
pected size of this set, i.e., the expected number of events that occur simultaneously. 
It is trivial that this number is at most a(G), so we get the inequality 


a(@) 


n 


(12.2) p< 


The same inequality can be derived in quantum physics, if we assume that it makes 
sense to talk about the number of events e; that actually hold in the given experi- 
ment (even though we can only check one of them). In the usual “hidden variable” 
interpretation of quantum physics this is the case. 

John Bell was the first to suggest simple measurement schemes and derive for 
them inequalities, which are related to (12.2), and could be experimentally verified 
(or rather falsified) [Bell 1964]. Since their derivation depends on the theory of 
hidden variables, disproving such an inequality disproves the hidden variable theory 
(at least in its simplest form). 

From basic quantum physical principles (not using hidden variables) one can 
only derive the weaker inequality 
(12.3) ps ue) 

n 
Indeed, state x is a unit vector in a complex Hilbert space H. Observing the event 
e; Means projecting x to a one-dimensional subspace, spanned by a unit vector 
u; € H; the probability that the event occurs is just the squared length of the 
projection, p; = |x-u;|?. Thus 


1 1 
p= 7 (Pit: + Pn) = aD, |x-uj|?. 
v7 


Two events that exclude each other must correspond to projections onto orthogonal 

vectors, and hence u is a dual orthogonal representation of G in a Hilbert space. 

This is a complex Hilbert space, but it is not hard to see that the value of the 

maximum in definition (11.8) of the theta-function does not change if we take 
complex Hilbert space instead of the real (Exercise (11.9)). Hence we get 

1 2- VG) 

=— x-ujl” < , 

e n » | i ~ 1 
From even weaker (simpler) principles, one gets an even weaker inequality. The 
exclusivity principle says that the sum of probabilities of mutually exclusive events 
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is at most 1. This implies the inequality 
bye <1 
i€B 
for every clique B in G; in the language of Section |11.4.2| the vector (p1,..., pn) 


must belong to the clique-constrained fractional stable set polytope QSTAB(G), 
and so it follows that 


" 
(12.4) pia 
n n 

Two special cases have been studied extensively. In the Clauser-Horne— 
Shimony—Holt version of the Bell experiment, one creates two entangled particles, 
say electrons, whose spins can be “up” or “down”; so the state of each particle is 
described by a 2-dimensional unit vector. These are sent to two far away observers, 
called (as usual) Alice and Bob. Alice has two possible settings a and a’ of her 
equipment measuring the spin, which means that she can choose one of these, and 
perform a measurement which returns “up” or “down”. Bob has, similarly, two 
settings b and Db’ (not the same settings as Alice). Let a; denote the event that 
Alice measures “up” in setting a. We have 8 analogous events. 

To reformulate this in our setting, consider the following 8 events: 
a+4b,,a_b_,a'b,,a_b_,a,b!,,a_b,a',b_, ab, (note the twist in the last two 
terms!). The exclusivity graph of these events is the Wagner graph Wg familiar 
from Example (Figure b)). Looking at the picture, it is easy to see 
that a(G) = 3. Let p denote the probability that selecting one of them uniformly 
at random and performing the corresponding measurements (one by Alice, one by 
Bob), the event does occur. Repeating the experiment many times, the value of p 
can be determined experimentally. 


a,b, ab’ 


FIGURE 12.1. The exclusivity graph Vg of the Clauser—Horne— 
Shimony—Holt experiment. The edge between (say) a/_b!, and 
a_b'_ indicates that they cannot occur simultaneously: for these 
two events, test b’ returns different results. 


Inequality (12.2) says in this case that 


a(G) 3 
< fe 
. n 


(12.5) = 5 = 0.375. 


Recall that this inequality follows from the theory of hidden variables. Since there 
are trivial linear equations relating the probabilities of the events a,b, etc., this 
inequality can be written in several equivalent forms, of which is the most 
convenient for us. 
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We have 0(G) = 2+ V2 (see Example}11.15), so the inequality 


(126) pe 


follows by (12.3) without the hypothesis of hidden variables, just by the laws of 
quantum physics. 


= 0.427... 


Remark 12.3. Traditionally, the inequality in the Clauser-Horne-Shimony—Holt 
experiment is expressed in terms of the quantum correlations, defined by 


E(ax;by) = E(H1y ey), 
and similarly for the other events. The CHSH inequality is then 
(12.7) E(a,,b4)+E(a,,b_)+E(a_,b,)—E(a_,b_) < 2. 


This inequality can be expressed in terms of probabilities as 
(12.8) P(a_b_) < P(a,b_)+P(a_b,)+P(a/,b4), 
which in turn is equivalent to (12.5) (see Exercise }12.1). 


Returning to the bounds and (12.6), which of these bounds is the 
“truth”? After a long line of increasingly sophisticated experiments, recent re- 
ports [Hensen et_al. 2015], [Giustina et al. 2015], claim to have 
eliminated all the implicit assumptions (“loopholes”), and show that the bound 
does not hold in general. The experiment in provides 
the value p + 0.401. This value is about half way between the bounds in and 
(12.6), and it can be considered as a disproof of the “hidden variable” interpretation 
of quantum physics (at least in its basic form). Whether other experimental setup 
will get even closer to the bound in c is an open question. 

The experiment described in works in a similar spirit, and 
leads to a more complicated measurement setup but to a simpler graph, namely 
Cs; see also [(Cabello—Severini-Winter 2014]. In this latter paper it was also shown 
that is best possible in a sense: for every simple graph G one can con- 
struct mathematical models of quantum physical systems in which equality holds. 
describes potential applications of these ideas to quantum com- 
puting. 


12.4. Unextendible product bases 


This application of orthogonal representations leads to the construction of 
highly entangled states. 

Our introductory discussion of two entangled states easily extends to a quantum 
physical system consisting of m parties. If the state of the i-th party is described 
by a unit vector in a complex Hilbert space C%, then a state of the whole system 
is described by a vector in the tensor product H = C"%@---@C%. A product 
state is a vector of the form v,0-+:oVvm, where vj € C%, |v;| = 1. Physically, this 
corresponds to the states of the individual parties to be “independent” , more exactly 
disentangled. We exclude trivial cases only if we assume that m,dj,...,dm > 2. 

For every product state v, let’s write v = v'o---ov™, where v’ € C% and 
|v;| = 1. This decomposition is unique up to scalars of absolute value 1, which we 
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can choose arbitrarily. The inner product of two product states can be computed 
as follows: 


(12.9) u-v = (u'-v')---(u™-v™). 


An incomplete orthogonal product basis, or shortly product basis is a system of 
mutually orthogonal product states in H. A product basis B is uneztendible, if it is 
not the subset of a larger product basis. In other words, the orthogonal complement 
B+ of the linear span of the basis vectors contains no product state. 

Unextendible product bases were introduced in in order 
to construct highly entangled states. In fact, they proved that such a state leads to 
high entanglement in two different senses. We state these consequences informally, 
without giving the exact definition and explaining their significance, which would 
lead us too far into physics. 


e From the mixed state obtained as the uniform distribution over all states 
(unit vectors) in B+ no product space can be “distilled” by local measurements 
and classical communication. 


e The states in an unextendible product basis are locally immeasurable, i.e., 
none of them can be distinguished from the others by local measurements and 
classical communication. 


There are trivial unextendible product bases: taking any orthonormal basis 

i,--.,bi, in each C%, the set of all vectors of the form bj,o---ob’” forms a 

full orthonormal basis in H, of cardinality d,---d,. Our goal is to construct an 
unextendible product basis as small as possible. 


Lemma 12.4. If {v1,...,Vvn} ts any uneztendible product basis in H, then n > 
1430", (di-1). If equality holds, then the vectors {vi : s = 1,...,n} are in 
general position in C% for every 1 <i<m. 


Proof. If n < 57",(d;—1), then we can partition the basis into m sets 
By,...Bm, where |B;| < d;—1. Then there is a unit vector u; that is orthogo- 
nal to the |B;| vectors v‘ (s € B;), and hence the vector u,0---ou,, is orthogonal 
to every v € B. This contradicts the assumption that B is unextendible. 

In the case of equality, the same argument can be applied provided one of the 
sets B; has d; elements but the vectors {v!: s € B;} do not span C%. 


The following result of |Alon—Lovasz 2001} translates the question whether 
equality can be attained in Lemma into a purely graph-theoretic question. 


Lemma 12.5. Let n = 14 57\",(d;—1). Then there exists an unextendible product 
basis with n elements in H if and only if there exists a decomposition Ky, = G,U 
---UG», into edge-disjoint graphs such that G; does not contain a complete bipartite 
subgraph with d; +1 nodes (t= 1,...,m). 


As we have used before, the condition on G; is equivalent to saying that its com- 
plement G; is (n—d;)-connected. 


Proof. I. Suppose that there is an unextendible product basis {vj,...,V,} in 
H. For i = 1,...,m, we define a graph G; on V = [n] as follows: we connect two 
nodes s,t € V by an edge if vi-v} = 0. The condition that v,-v, = 0 implies by 
(12.9) that every pair (s,t) of nodes will be connected in at least one graph Gj. (If 
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the pair gets connected in more than one graph G;, we can delete this edge from 
all but one of the graphs.) 

We have to show that G; does not contain a complete bipartite subgraph on d; + 
1 nodes. Suppose it does, and let U, W C V be the color classes, where |U|, |W| > 1 
and |U|+|W| > d;+1. For every s € U and t € W, we have vi-v} = 0. So the 
subspaces A = lin{ut: s €U and B=lin{u’: s € W (over C) are orthogonal to 
each other, which implies that dim(A)+dim(B) < d;. It follows that one of U and 
W, say W, consists of linearly dependent vectors, contradicting Lemma/12.4] 

II. Suppose that K,, has a decomposition as described. Theorem [10.9] implies 
that G; has an orthogonal representation (uj: ¢ € V) in R® in general position. 
The vectors v; = ujo---ou)” (t=1,...,n) form a product basis: if 1<s<t<n, 
then 

Vs: Ve = (uy-uj) +++ (uy'-uj") = 0, 
since st € E(G;) for some 7, and then u’- uj = 0. 

We show that the product basis {v,,...,V,} is unextendible. Indeed, suppose 
that it could be extended by a product state w. Then w is orthogonal to every 
vector v;, and hence 


(wi -ujy)-:-(w™”-ul”) =0 (t=1,...,n). 
Hence for every 1 < t <n there is an index 1 < i < m for which w’-uj = 0. But 
since the vectors uj,..., uj, are in general position, one and the same index 7 could 


serve at most d;—1 choices of t. Hence n < }°,(d;—1), a contradiction. 


Unfortunately, the condition given in Lemma is not always sufficient for 
the existence of a decomposition of K,, as in the lemma. Suppose that equality 
holds: 


(12.10) n=1+5 (d;-1), 


and there is a decomposition K, = G,U---UG,, as in the lemma. Since G; contains 
no i; a,, the maximum degree in G; is at most d;—1. So implies that G; is 
regular of degree d; —1. This is impossible if n is odd and d; is even. Another, more 
trivial exception is when m = 2 and d, = 2 < dg (or the other way around): Then 
G, must be a perfect matching, and so Gp contains a complete bipartite graph 
Kon—2 = Ko,a,-1- 

It was shown in that these are the only exceptional cases: 


Theorem 12.6. Let m,di,...,dm > 2, andn =1+ 50" ,(d;—-1). Assume that if 
n is odd then every d; is odd, and if m = 2 then d,dz2 > 3. Then there exists an 
unextendible product basis with n elements in C& @---@C%™. 


The proof uses Lemma}12.5| by constructing an appropriate decomposition of 
K,,. For the details, we refer to |Alon—Lovasz 2001}. Other constructions, settling 
several of the exceptional cases, were given by |Chen—Johnston 2014}. 


Exercise 12.1. Prove that (12.5) is equivalent to the simpler inequality 
P(a,b4) < P(a,b_)+P(a_b,)+P(a‘b4). 


CHAPTER 13 


Semidefinite Optimization 


In our treatment of orthogonal representations, a major tool was the duality 
theory for the theta-function (Section{11.2). We have remarked there that the main 
step could be considered as a special case of semidefinite duality. Since semidefinite 
optimization has other important applications in discrete mathematics, foremost in 
the area of approximation algorithms, but also in constructing geometric represen- 
tations, it is now time to survey some of the key results and illustrate the general 
technique that leads to these applications. 


13.1. Linear and semidefinite programs 


Linear programming has been perhaps the most fundamental and successful 
tools in optimization and discrete mathematics. We assume some basic familiarity 
with this theory; see e.g. [Vanderbei 2001], [Schrijver 2003], [Korte—Vygen 2008}. 

Linear programs are special cases of convex programs; semidefinite programs 
are more general than linear programs but still convex programs, to which 
many of the useful properties of linear programs extend. For more compre- 
hensive presentations of semidefinite optimization, see [Vandeberghe-Boyd 1996}, 
WolkowiezSaigal-Vandenberghe 2000). 


13.1.1. Different forms of semidefinite programs. A semidefinite pro- 
gram is an optimization problem of the following form: 


(13.1) minimize clx 
subject to 2,A,+---+2%,4,—-—B> 0. 


Here Aj,...,An,B are given symmetric mxm matrices, and c € R” is a given 
vector. Recall that MM > 0 means that M is positive semidefinite. We can think of 
oe x;A;—B as a matrix whose entries are linear functions of the variables. 

As usual, any choice of the values x; that satisfies the given constraint is called a 
feasible solution. A solution is strictly feasible, if the matrix )>, 2;A;—B is positive 
definite. We denote by Uprimai the infimum of the objective function. 

The special case when Aj,...,A,,B are diagonal matrices is just an arbitrary 
linear program, and it is very fruitful to think of semidefinite programs as general- 
izations of linear programs. But there are important technical differences. Unlike 
in the case of linear programs, the infimum may be finite but not a minimum, i-e., 
not attained by any feasible solution. 
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Example 13.1. Consider the following simple semidefinite program in two real 
variables x and y: 


(13.2) minimize 2 


subject to & > = 0. 
ley 


The constraint means that « > 0, y > 0 and xy > 1, which implies that x > 0, but 
x can be arbitrarily close to 0. 4 


As in the theory of linear programs, there are a large number of equivalent 
formulations of a semidefinite program. Of course, we could consider minimization 
instead of maximization. We could allow additional linear constraints on the vari- 
ables x; (inequalities and/or equations). These could be incorporated into the form 
above by extending the A; and B with new diagonal entries. 

We could introduce the entries of the matrix X = }7, x;A;—B as variables, in 
which case the fact that they are linear functions of the original variables translates 
into linear equations between them. Straightforward linear algebra can be used to 
transform into an optimization problem of the form 


(13.3) minimize CO-X 
subject to X =0 
Dik CS tanh) 


where C, D,,..., Dz are symmetric m xm matrices and d;,...,d, € R. Note that 
C'-X is the general form of a linear combination of entries of X, and so D;-X = d; 
is the general form of a linear equation in the entries of X. It is easy to see 
that we would not get any substantially more general problem if we allowed linear 
inequalities in the entries of X in addition to the equations. 

With these variants in mind, it should be clear that equations and 
express the theta-function as optima of semidefinite programs. The relationship 
between these programs will be discussed next. 


13.1.2. Duality. Duality is one of the most important aspects of linear pro- 
grams, and we continue our treatment with generalizing duality to semidefinite 
programs. We start with generalizations of the Farkas Lemma, which in its original 
form characterizes the solvability of systems of linear inequalities. Let S denote 
the linear space of real symmetric m xm matrices, and P C S, the set of positive 
semidefinite matrices. The set P is a convex pointed cone. Let P’ be the set of 
positive definite matrices, which is just the interior of P. 


Lemma 13.2 (Farkas Lemma, homogeneous version). Let LC S be a linear 
space of symmetric matrices. Then exactly one of the following alternatives holds: 
(i) There is a matrix X € L, X > 0. 
(ii) There is a matrix Y € L+,Y #40, Y =0. 


We could state this fact as follows: either both £ and £+ contain nonzero 
positive semidefinite matrices, or one of them contains a positive definite matrix. 


Proof. Suppose that both alternatives hold. Then the conditions X € £ and 
Y € L+ imply that X-Y = 0, while the conditions X > 0, Y > 0 and Y # 0 imply 
that XY > 0. (To see this, use an orthonormal basis in which Y is diagonal.) 
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Conversely, suppose that £NP’ = §. Then there is a hyperplane H C S 
containing £ but disjoint from P’. The equation of this hyperplane is Y-X = 0 
with some symmetric matrix Y 4 0. Clearly Y € £+. Assuming that P’ is on the 
positive side of H, we have Y-X > 0 for every positive definite matrix X, which 
implies that Y-X > 0 for every positive semidefinite matrix X. This implies that 
Y>0. 


Depending on how the subspace £ is given, the Lemma can be reformulated 
in several ways. For example, let A1,..., A, be symmetric m xm matrices. Then 
exactly one of the following alternatives holds: 

(i) There are 21,...,2, € R such that 7;Ai+---+a,Ap > 0. 

(ii) There exists a symmetric matrix Y 4 0, Y > 0 such that Ay-Y =--- = 
An: Y =0. 

The Semidefinite Farkas Lemma has several inhomogeneous versions; here we 
state one (see the exercises for others). 

We need some notation. Let A be an affine subspace of a Euclidean space S 
that does not contain 0, and let A* = {Y €S: Y-X =-1VX € A}. It is easy 
to see that (A*)* = A. Let A denote the linear subspace obtained by translating 
A to the origin, and let A* be obtained similarly from A*. It is easy to see that A 
and A* are orthogonal linear subspaces with dim(A) +dim(A*) = dim(S)—1, and 
A* = lin(A)+. 

Suppose that A is defined by the set of equations A;-X = c;, i = 1,...,n. 
Then elementary linear algebra gives the following expressions for the associated 
spaces: 


(13.4) A ={XeS: Ay X=---=A,-X =o}, 


Now let S be the set of symmetric m x m matrices. It is trivial that A and A* 
cannot both contain positive semidefinite matrices. It would be nice if one of them 
always did, but this is not true, as the following example shows. 


Example 13.3. Let 


then 


ee <i 
A= {( 1 *): ser}, 
-1 0 
and these subspaces contain no positive semidefinite matrices (Figure|13.1). Both 
A and A* are disjoint from P, but they get arbitrarily close. 


But we can state the following. 


Lemma 13.4. Let A be an affine space of symmetric m xm matrices that does not 
contain the 0 matrix. Then either A or A* contains a nonzero positive semidefinite 
matriz, or both A and A* contain nonzero positive semidefinite matrices. 
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X99 


FIGURE 13.1. The space of 2 x 2 symmetric matrices, the positive 
semidefinite cone P, and an affine subspace A getting arbitrarily 
close to P. The subspace A* has similar properties. 


If the last alternative occurs, then Lemma implies that lin(A) and lin(A*) 
contain no positive definite matrices. 


Proof. Clearly A is a linear hyperplane in lin(A). If there is a positive semi- 
definite matrix Z € lin(A)\A that is on the same side of A as A, then Z has 
a positive multiple in A and we are done. So we may assume that PMlin(A) is 
contained in the closed halfspace of lin(A) bounded by A not containing A. 

This implies that A contains no positive definite matrix, and hence there is a 
matrix Y 4 0 such that the hyperplane Yt = {X € S: X-Y =0} contains A but 
no positive definite matrix. We may assume that X-Y > 0 for all X € P’, then 
X-Y > 0 for all X € P, and hence Y > 0. 

If Y ¢ lin(A)+ = A*, then Y+nlin(A) = A, and so Y+N.A = 0, which implies 
that Y-A =a <0 for all Ac A. Then —(1/a)Y is positive semidefinite matrix in 
A*, and we are done. If Y € A‘, then A* contains a positive semidefinite matrix. 
The conclusion follows for A similarly. 


Now we turn to duality. Recall the meaning of the dual variables in linear 
programming: they are multipliers for the constraints so that a linear combination 
of the constraints with these coefficients gives the sharpest possible bound on the 
objective function. With this view in mind, it is clear that equalities can have 
arbitrary real multipliers, and inequalities must have nonnegative multipliers (or 
nonpositive ones, if we want to reverse the direction of the inequality). If we want 
to construct a dual program in semidefinite programming, the only (but nontrivial!) 
addition is that the multiplier for a semidefiniteness constraint X > 0 is a positive 
semidefinite matrix Y > 0. This is based on the fact that X > 0 is equivalent to 
X-Y > 0 for every Y > 0. 
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Consider a semidefinite program, which we recall for convenience: 


(13.5) minimize S- iL; 


i=l 


nm 
subject to So iA; = B. 


We can follow the “rules” above for defining multipliers for the constraints, and 
formulate the dual program: 


(13.6) maximize B-Y 
subject to A;-Y=q; (¢=1,...,n) 
Y>=0. 


Note that this too is a semidefinite program in the general sense. We denote by 
Vaual the supremum of the objective function. 

With this notion of dual programs, the Duality Theorem holds un- 
der additional conditions (which cannot be omitted!); see [Wolkowitz 1981], 
Vandeberghe Boyd 1996), [Ramana 1997]. 


ran (i 13.5. Assume that both the primal program (13.5) and the dual pro- 
gram ) have feasible solutions. Then Vprimal > Udual- Lf, in addition, the dual 
program Ee a strictly feasible solution, then the primal optimum is attained and 


Uprimal = Udual- 


It is easy to see that the roles of primal and dual could be interchanged: if the 
dual program has a strictly feasible solution, then the primal optimum is attained, 
and the two optima are equal. In particular, if both programs have strictly feasible 
solutions, then the supremum, respectively infimum, of the objective functions are 
attained and are equal. 


Proof. Let x be a feasible solution . the primal program and Y, a 
feasible solution of the dual program (13.6). Then 
(13.7) 


Downe = A Yaa (Yin i)'¥ =(SomAi-B)-Y+B-Y > B-Y. 


This proves that Vprimal > Vdual- 

Suppose that the primal program has a strongly feasible solution, but 
either Vprimal # Udual Or the dual optimum is not attained. In either case, the 
dual has no solution with objective value t = Vprimal. This means that the affine 
subspace A of S defined by 


(13.8) BY=t, <A;-Y=q, (i=1,...,n) 


contains no positive semidefinite matrix. 
Next, assume that the subspace A* contains a positive semidefinite matrix X. 
By the representation (13.4), this has the form 


(13.9) X= aoB+ ) = x Ai, where aot tS) aici =-l. 
i=l 


i=1 
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For any dual solution Y, we have 


n n 
X-Y=aB-Y+) aA Y =aB-Y+ 5 xe; = xo(B-Y-t)-1, 
i=1 i=1 
and since B-Y < t, we must have rp < 0. Hence —(1/zo)X is a solution of the 
primal program (13.6) with objective value t+1/xo < t, contradicting the definition 
of t. 

Thus neither A nor A* contains a positive semidefinite matrix. By Lemma 
13.4} this implies that neither lin(A) nor lin(A*) can contain a positive definite 
matrix. But for a strongly feasible primal solution, the matrix )°,27;A;—B is a 
positive definite matrix in lin(A*), a contradiction. 


The following complementary slackness condition also follows from (13.7). 


Proposition 13.6. Let x be a feasible solution of the primal program (13.5) and 
Y, a feasible solution of the dual program (13.6). Then both x and Y are optimal 
solutions if and only if (>, t:Ai—B)-Y =0. 


Using that for two positive semidefinite matrices X and Y, we have X-Y = 0 
if and only if XY = 0 (matrix product!), we see that the complementary slackness 
condition is equivalent to ()°,2;Ai—B)Y =0. 


13.1.3. Algorithms for semidefinite programs. There are two essentially 
different algorithms known that solve semidefinite programs in polynomial time: 
the ellipsoid method and interior point/barrier methods. Both of these have many 
variants, and the exact technical descriptions are quite complicated; we refer to 
for discussions of these. 

The first polynomial time algorithm to solve semidefinite optimization prob- 
lems in polynomial time was the ellipsoid method. This is based on the general 
fact that if we can test membership in a convex body K C R¢? (i.e., we have a 
subroutine that, for a given point x € R%, tells us whether or not x € K), then 
we can use the ellipsoid method to optimize any linear objective function over K 
Grotschel-Lovasz—Schrijver 1988]. The precise statement of this fact needs non- 
trivial side-conditions. 

For any semidefinite program (13-1), the set K of feasible solutions is convex. 
With rounding and other tricks, we can make it a bounded, full-dimensional set in 
R”. To test membership, we have to decide whether a given point x belongs to K; 
ignoring numerical problems, we can use Gaussian elimination to check whether the 
matrix Y = be x; A;—B is positive semidefinite. Thus using the ellipsoid method 
we can compute, in polynomial time, a feasible solution x that is approximately 
optimal. 

Unfortunately, the above argument gives an algorithm which is polynomial, 
but hopelessly slow, and practically useless. Semidefinite programs can be solved in 
polynomial time and also practically reasonably efficiently by interior point methods 
[Alizadeh 1995]. The algorithm can be described 
very informally as follows. We consider the form (13.3), denote by K the set of its 
feasible solutions (these are symmetric matrices), and want to minimize a linear 
function C'.X over X € K. The set K is convex, but the minimum will be attained 
on the boundary of K, and this boundary is neither smooth nor polyhedral in 
general. Therefore, neither gradient-type methods nor simplex-type methods of 
linear programming can be used. 
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The main idea of barrier methods is that instead of minimizing a linear objective 
function C'X, we minimize the convex function F\(x) = — log det(X)+AC™X for 
some parameter A > 0. Since F, tends to infinity on the boundary of K, the 
minimum will be attained in the interior. Since F\ is convex and analytic in the 
interior, the minimum can be very efficiently computed by a variety of numerical 
methods (conjugate gradient etc.) 

Of course, the point we obtain this way is not what we want, but if A 
is large it will be close. If we do not like it, we can increase A and use 
the minimizer for the old Fy as the starting point for a new gradient type 
algorithm. One can show that (under some technical assumptions about the 
feasible domain) this algorithm gives a good approximation of the optimum 


in polynomial time (see [Alizadeh 1995] ,/Vandeberghe—Boyd 1996] and the book 
Nesterov-Nemirovsky 1994]) . 


13.2. Geometric representations from semidefinite optimization 


13.2.1. Unit distance representation. Given a graph G, how can we rep- 
resent it by unit distances? To be precise, a unit distance representation of a graph 
G is a geometric representation u: V — R¢ for some d> 1 such that |u;—u,| = 1 
for every ij € E (we allow that |u;—u,| = 1 for some nonadjacent nodes i, 7). 

Every finite graph has a unit distance representation in a sufficiently high di- 
mension. (For example, we can map its nodes onto the vertices of a regular sim- 
plex with edges of length 1.) The first problem that comes to mind, raised in 
Erdés—Harary—Tutte 1965), is to find the minimum dimension d in which a graph 
has a unit distance representation. Figure[13.2]shows a 2-dimensional unit distance 
representation of the Petersen graph. Similarly to orthogonal representations of the 
complement (and by a very similar argument) the minimum dimension d of a unit 
distance representation is linked to the chromatic number of the graph: 


(13.10) 5 los x(@) ads 0G) 1, 


FIGURE 13.2. A unit distance representation of the Petersen graph 
in the plane. 


There are many other senses in which we may want to find the most economical 
unit distance representation. We only discuss one: what is the smallest radius of a 
ball containing a unit distance representation of G (in any dimension)? 

Considering the Gram matrix A = Gram(u) of a vector labeling u, it is easy 
to obtain the following: A graph G has a unit distance representation in a ball of 
radius R (in some appropriately high dimension) if and only if there exists a positive 
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semidefinite matrix A such that Aj; < R? for all (i € V) and Aj;—-2Aij+Aj;; = 1 
for all («7 € E). In other words, the smallest radius R is the square root of the 
optimum value of the following semidefinite program (and so it is polynomial time 
computable with arbitrary precision): 


(13.11) minimize w 
subject to A> 0 
Ay < w (iEV) 
Ay —2Aij + Ajj =1 (ij € B). 


Example 13.7. The unit distance embedding of the Petersen graph in Figure[13.2| 
is optimal if we want to minimize the dimension, but not if we want to minimize 
the circumradius. Let us illustrate how semidefinite optimization can find the 
optimal embedding by determining this for the Petersen graph. In the formulation 
above, we have to find a 10 x 10 positive semidefinite matrix A satisfying the given 
linear constraints. For a given w, the set of feasible solutions is convex, and it 
is invariant under the automorphisms of the Petersen graph. Hence there is an 
optimum solution which is invariant under these automorphisms (in the sense that 
if we permute the rows and columns by the same automorphism of the Petersen 
graph, we get back the same matrix). 

Now we know that the Petersen graph has a very rich automorphism group: not 
only can we transform every node into every other node, but also every edge into 
every other edge, and every nonadjacent pair of nodes into every other nonadjacent 
pair of nodes. A matrix invariant under these automorphisms has only 3 different 
entries: one number in the diagonal, another number in positions corresponding 
to edges, and a third number in positions corresponding to nonadjacent pairs of 
nodes. This means that this optimal matrix A can be written as 


A=aP+yJ+zI, 


where P is the adjacency matrix of the Petersen graph. So we only have these 3 
unknowns x, y and z to determine. 

The linear conditions above are easily translated into the variables x, y, z. But 
what to do with the condition that A is positive semidefinite? Luckily, the eigen- 
vectors of P are also eigenvectors of J and J, and hence the eigenvalues of A are 
the corresponding linear combinations of the eigenvalues of P, J and I. The eigen- 
values of P are well known (and easy to compute): they are 3, 1 (5 times) and -2 
(4 times). So the nonnegativity of the eigenvalues of A, together with the linear 
conditions gives the linear inequalities in give us the linear program 


minimize y+z 


38@2+10y+z2 =O), 


>0 
subject. to oe Soe 
—2¢+z > 0, 
22-22 =I. 
It is easy to solve this: « = —1/4, y = 1/20 and z = 1/4. Thus the smallest 


radius of a ball in which the Petersen graph has a unit distance representation is 
Jy +z = 3/10. The corresponding matrix A has rank 4, so this representation 
is in R*. 
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It would be difficult to draw a picture of this representation, but we can offer 
the following nice matrix, whose columns realize this representation: 


1 111000 0 0 0 


;/re oot oo 0 
(13.12) sie. OO t O:O-1 1 0 
0010010101 
0001001011 


This matrix reflects the fact that the Petersen graph is the complement of the 
line-graph of Ks. The center of the smallest ball containing it is not the origin; 
translating it to the origin, we get a representation in 4-space, but with a less 
transparent matrix. 4 


Unit distance representations are not only analogous to orthogonal represen- 
tations, but in fact closely related; see Exercises |13.2| They are also special 
cases of bar-and-joint frameworks, to be discussed in Chapter 


13.2.2. Boolean variables and semidefinite optimization. Discrete opti- 
mization problems typically ask for an optimal subset of a set with given properties, 
and with maximum or minimum weight, or some other extremal property. A solu- 
tion of such a problem can be encoded as a Boolean vector. Often the constraints 
and the objective function can be expressed as quadratic functions of these vari- 
ables. If this is the case, we have a chance to use semidefinite optimization to obtain 
good bounds on the optimum. 

In the following general description, we consider the Boolean values of the 
variables as real variables 71,..., 2», where x; = 1 means TRUE and x; = 0 means 
FALSE. The quadratic equation x? = x; expresses that « is 0-1 valued. For this 
discussion, assume that the problem is a maximization problem with some (typically 
NP-hard) optimum value vopt. It is often useful to homogenize the problem; for 
this, we introduce 79 = 1. The condition on the x; becomes x? —2£;X20 = 0, and we 
also have x3 = 1. 

The symmetric matrix X = (x;x;)}';<o is positive semidefinite. There are some 
further general conditions following just from the definition of X: 


(13.13) Xoo = 1, 
(13.14) Xio —_ Xoi — X ii (¢ = i lacs ,n). 


This is not all; as an example, the following inequality is often useful (it can be 
verified by checking the possible values of 7;,2; and x,; it does not follow from 
positive semidefiniteness): 


(13.15) Xijg +X jk < Xj; + Xiz (1 Stk < n). 


To obtain a semidefinite relaxation of the problem, we express the conditions 
as linear constraints on the entries of X. This step is of course problem-specific, 
and we will discuss some important applications in the sect sections. 

Then we solve the semidefinite optimization problem with these constraints, 
and obtain an optimizer matrix X and an optimum value vsq. Since the semidefinite 
problem is a relaxation, we have vopt < Usa. How good is this approximation? And, 
equally importantly, how to get an optimum solution of the original problem from 
xX? 
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We can write X = Gram(u) for some vector labeling u. Equation 
implies that ug is a unit vector; the vectors u; (i > 0) satisfy uJ(ug—u,) = 0, 
so the vectors u;, Up —u,; and Ug form a right triangle. We can also say that u; 
lies on the Thales sphere with diameter [0, uo]. Condition translates into 
(u;—u,;)'(u;—ux) > 0, which in the geometric language means that no three of 
the labeling vectors form an obtuse triangle. 

More generally, the vector labels satisfy algebraic conditions that are formally 
similar to original quadratic constraints. For example, a constraint 7;2; = 0 implies 
Xj; = 0, which implies uju; = 0. If we are lucky and the vectors u, all belong 
to the same 1-dimensional space, then we get an optimal solution of the discrete 
problem right away. 

But in general we are not so lucky, and we need to extract and approximate 
solution from the vectors u;. The simplest method is to generate a random unit 
vector v, and define 
_ 1, ifv'u; > $v'Uo, 

7 0, otherwise. 


Geometrically, we take a random hyperplane through the center of the Thales 
sphere, and define % as the incidence vector of the set of nodes represented in 
the halfspace not containing the origin. The difficult question is, how far is this 
solution from the optimum? The analysis of a few important applications in the 
next sections shows that the answer to this question is often not easy, but the 
method is quite powerful. 

This method can augmented with several nontrivial tricks, and other extrac- 
tion methods have also been developed; and refer to and 
for more. 

While it is more common to use x; = 0 for FALSE, one could also use 7; = 
—1 to encode this. This means only a linear transformation, and sometimes one, 
sometimes the other is more useful. In the examples below, both approaches come 


up. 


13.2.3. Stable sets. We want to find a maximum stable set S in the graph 
G. We can assign 0-1 valued variables x; to the nodes, where x; = 1 means that 
i € S. The condition that S is stable means that x;x; = O for all edges ij, and 
the cardinality of S is just the sum )°, x;. It will be convenient to homogenize by 
introducing a new variable 2 = 1. Then our conditions are x? = xox; for every 
node 7, and x;x; = 0 for every edge ij. The objective function to maximize can be 
written as )ojcy #?. 

In terms of the (n+1) x (n+1) matrix X = (xjxj)?j;~9, we have 


maximize s X ii 


i€EV 
subject to A>0O, 
Xoo = 1, 
(13.16) Xio = Xu = Xu (GEV), 


Xij=0 (ij € B). 


Writing X = Gram(w), we get an orthogonal representation together with a unit 
vector Wo. The vectors v; = w? form a dual orthonormal representation with 
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handle d = wo. We know that w;, d— w; and d form a right triangle, which means 
that w; is the orthogonal projection of d onto v;, and hence |w,| = d'v;. Thus the 
value of the objective function is 


S- Xi = be |w;|? = S/(d'vi)?. 

ieV ieV ieV 
Since 0(G) is the maximum of the right hand side over all orthonormal representa- 
tions, it follows that vsq < 0(G). It is easy to check that we have equality here. So 
Vv can be defined as the most basic semidefinite relaxation of the stability number. 

A next step is to extract a reasonably large stable subset from the optimum 

solution of (13.16). We have done so in Section [11.4.1] (with a very large error, but 
as remarked there, one cannot expect to do better). 


13.3. Maximum cut 


Next we describe the application of semidefinite optimization combined with 
geometric techniques to another basic algorithmic problem, due to Goemans and 


Williamson |Goemans—Williamson 1995). 


13.3.1. The Maximum Cut Problem. A cut in a graph G is the set of 
edges connecting a set S C V to V\S, where 9 Cc S CV. The Maximum Cut 
Problem is to find a cut with maximum cardinality. We denote by maxcut(G) this 
maximum. (More generally, we can be given a weighting w: V — Ri, and we 
could be looking for a cut with maximum total weight. To keep things simple, 
however, we restrict our discussion to the unweighted case.) 

The Maximum Cut Problem is NP-hard; it was one of the first problems shown 
to be NP-hard by Karp. A natural next step is to find an “approximately” maxi- 
mum cut. Formulated in different terms, described the following sim- 
ple heuristic algorithm for the Maximum Cut Problem: for an arbitrary ordering 
(U1,---,;Un) of the nodes, we color v1, v2,...,Un successively red or blue. For each 
zi, v; is colored blue if and only if the number of edges connecting v; to blue nodes 
among v;,...,U;—-1 is less than the number of edges connecting uv; to red nodes in 
this set; otherwise, it is colored blue. Then the cut formed by the edges between 
red and blue nodes contains at least half of all edges. In particular, we get a cut 
that is at least half as large as the maximum cut. 

There is an even easier randomized algorithm to achieve this approximation, at 
least in expected value. Let us 2-color the nodes of G randomly, so that each node 
is colored red or blue independently, with probability 1/2. Then the probability 
that an edge belongs to the cut between red and blue is 1/2, and so the expected 
number of edges in this cut is m/2. 

These algorithms show that the maximum cut can be approximated from be- 
low in polynomial time with a multiplicative error of at most 1/2. Can we do 
better? The fundamental paper [Goemans— Williamson 1995], building on results 
of [Mohar-Poljak 1900|Poljak-RendI 1992) [Delorme Poljak 1993a), gives a polyno- 
mial time algorithm that approximates the maximum cut in a graph with a relative 
error of less than 13%: 


Theorem 13.8. One can find a cut with at least .878 maxcut(G) edges in polyno- 
mial time. 
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What is this strange constant? The proof below will show that its exact value 
is 2c/7, where c is the largest positive number for which 
(13.17) arccost > c(1—t) 


for -1 <¢t < 1 (Figure]13.3). It can be shown that 2c/z is an irrational number, 
of which the value .878 in the theorem is only an approximation. 


arccos x 


FIGURE 13.3. The constant c in the Goemans—Williamson algorithm 


Can we do even better? Not likely. showed that it is NP-hard 
to find a cut with more than (16/17)maxcut(G) ~ .941 maxcut(G) edges, so we 
cannot get arbitrarily close to the optimum. The constant 2c/a ~ .878 would seem 
like a “random” byproduct of a particular proof, and one would expect that the 
“truth” (the best achievable constant) is somewhere between .878... and .941..., 
perhaps some nice number like 9/10. However, it turns out that if we assume the 
complexity theoretic hypothesis called the “Unique Games Conjecture” (stronger 
than P 4 NP), then the constant 2c/7 is optimal [Khot_et_al. 2007]: no polynomial 
time randomized algorithm can produce a cut that for every graph would contain 
at least a maxcut(G) edges, where a > 2c/7. 


13.3.2. Semidefinite formulation. Let us assign a +1 variable x; to each 
node, where the sign encodes the two sides of the cut we want to find. These 
variables satisfy x? = 1. There are no instance-dependent constraints. For an edge 
ij, the difference x;—,; is zero if 7 and 7 are on the same side of the cut, and +2 
otherwise. Hence the number of edges in the cut can be expressed as 


1 
(13.18) — (a;-2;)? = qu le 


(where L is the Laplacian of the graph). In terms of the symmetric V x V matrix 
X = xx' = (xi2;)ijev, we have the semidefinite optimization problem 


(13.19) maximize qh X 
subject to X > 0 
Xi = 1 “@eEV) 
It is not hard to see that the maximum is attained, since the conditions X;; = 1 
imply that |X;;| < 1 for alli and j and so the set of solutions is compact. This 


semidefinite program is only a relaxation of the maximum cut problem: the matrix 
X has the additional property that its rank is 1. This property, however, cannot 
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be expressed as a linear condition on the matrix entries. Let y(G) denote optimum 
value in (13.19). 

We can write the optimizer in as X = Gram(u), then the vector labeling 
u consists of unit vectors, and the objective function is 


(13.20) —— u; U;. 


13.3.3. The dual problem. During our treatment of orthogonal representa- 
tions and the theta-function, we have seen that semidefinite duality has played a 
crucial role, leading not only to the technical tools of T heorem [IT.2} but also to 
duality relations between the graph and its complement (Proposition [11.4] Corol- 
lary [11.6] and Corollary [11.27). Let us conclude with describing the duality of the 
semidefinite relaxation of the maximum cut problem. It is not hard to see that the 
dual of (13.19) can be written as 


bade i 
(13.21) minimize 7 ys Zi 
tEeV 
subject to diag(z)-L>0 


In words, we want to add to the diagonal entries of —L as little as possible to make 
it positive semidefinite. Clearly both and have strictly feasible 
solutions, and both optima are attained and equal to y(G). 

In terms of the adjacency matrix, we can rewrite this as 

m 

2 
So we want to increase the diagonal entries of the adjacency matrix Ag to make 
it positive semidefinite, and want to minimize the total increase. This minimum is 
just y(G) —m/2. 

We can write a solution as diag(z)— ZL = Gram(v), then the vector labeling v 
satisfies 


1 
(13.22) 7(G) sr min{1'x: diag(x)+Ag > 0} 


1, ifijek 
(13.23) vive) es 
0, ifiek, 
and, letting v run over all vector labelings satisfying (13.23), 
1 1 
(13.24) V(G) = min 1 y (Ivi|? +deg(i)) = = ae a Ivil?. 


We sce that this is always at least m/2, as it should be. Note that v is a particularly 
nice orthogonal representation. 

An argument similar to the proof of Theorem[I1.5|shows that if the graph has a 
node-transitive automorphism group, then adding the same value in the diagonal is 
optimal. This implies that for a graph with a node-transitive automorphism group, 

mon 


13.2 LL ae 


In particular, 


1f/n n n? n? 
UK) =5(5)+ $= F Wvana) =" 
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Example 13.9. Bipartite graphs G are trivial examples for the Maximum Cut 
Problem, since all edges can be included in a cut. How is the semidefinite pro- 
gramming bound? We claim that adding the degrees deg(i) in the diagonal of the 
adjacency matrix, we get a positive semidefinite matrix. Indeed, multiplying the 
rows and columns corresponding to nodes in one color class (which does not change 
the signs of the eigenvalues), we get Lc, which is positive semidefinite. 


This implies, by (13.22), that 


m 1 mm 
<= = i) = — —= , 
(GG) < 5 a : deg (2) 5 + Nil 


Since a cut of size m can be attained, it follows that 7(G) = m, and that x; = deg(i) 
is an optimizer in (13.22). 4 


Example 13.10. Consider the line-graph H = L(G) of a simple connected graph 
G. The maximum cut of H is not hard to compute [Guruswami 1999]. A partition 
of V(H) = E corresponds to a partition E’ UE” = E. Let deg(i) and deg’ (i) 
denote the degree of node i in G and in (V, E’), respectively. Then the number 
of edges of H between E’ and E” is the number of pairs of edges e’ € E” and 
e’ € E” that share an endpoint i, and so it is )>,¢1 deg’ (i)(deg(i) — deg’ (i)). 
Clearly deg’ (i) ( deg(i) — deg’(i)) < |deg(i)?/4], and hence 


maxcut(H) < S- pce : 
icV 


This upper bound can be attained except in one case, when it can be improved by 


1 (see Exercise [13.8): 
2 
(13.26) maxcut(H) = S- ee | —€, 


; 4 
teV 


where € = 1 if G is an eulerian graph with an odd number of edges, and ¢ = 0 
otherwise. 

To estimate the upper bound 7, we assign to each edge ij the vector 144,54 € RY. 
Then we get an orthogonal representation with properties (13.23), and so 


|E(H)| | m deg(i)? 

(13.27) 98 Sar ee er a er me 
iEeV 

Equality does not hold in all cases, but it does when G has a node-transitive auto- 
morphism group. We see that for a line-graph H, the value 7(H) is a reasonable 
bound on maxcut(#); it is equal to maxcut(H), among others, when G is an euler- 
ian graph with an even number of edges, and asymptotically equal if the degrees of 
G are large. 4 


13.3.4. Extracting a cut. The semidefinite optimization problem 
can be solved in polynomial time (with an arbitrarily small relative error; let’s 
not worry about this error here). The optimum value y(G) is a polynomial time 
computable upper bound on the size of the maximum cut. 

Following the general method, we write the optimal matrix as X = Gram(u), 
to get a vector labeling with unit vectors in some space R@, for which the objective 
function is maximized. To define the cut, we generate a random hyperplane 
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FiIGuRE 13.4. A cut in the graph given by a random hyperplane 


in R®@ (its normal vector should be uniformly distributed over 9%), and define the 
two sides of the cut as the two sides of this hyperplane (Figure [13.4). 

How good is this cut? Let ij € E and let u;,u; € S"~* be the corresponding 
vectors in the representation constructed above. It is easy to see that the probability 
that a random hyperplane H through 0 separates u; and u, is (arccosuju,)/7. 
Using the constant c introduced above, the expected number of edges intersected 
by H is 


T T 
arccos U; U; 1-uu; 2c 2c 
13.28 hr areeree aaa os == t(G). 
( ) De = > ds — = 9(G) > — maxcu (G) 
jek wjEkH 


This completes the analysis of the algorithm, and proves Theorem 


Remark 13.11. Often the Max Cut Problem arises in a more general setting, 
where the edges of a the graph have nonnegative weights (or values), and we want 
to find a cut with maximum total weight. The algorithm described above is easily 
extended to this case, by using the weights as coefficients in the sums (13.18), 


and (13.25) 


Remark 13.12. It is not quite obvious how to generate a random hyperplane in 
a high-dimensional linear space. We want that the random hyperplane does not 
prefer any direction, which means the normal vector of the hyperplane should be 
uniformly distributed over the unit sphere in R”. Such a vector can be generated 
by normalizing a standard Gaussian (see in the Appendix). 

Another objection to the above algorithm could be that it uses random real 
numbers. This can be replaced by random integers from a polynomial-size set, simi- 
larly as at the end of Section [3.3.3] In fact, the algorithm can be fully derandomized 
by well established (but nontrivial) techniques. We do not discuss this issue here; 


see e.g. |Alon—Spencer 1992|, Chapter 15 for a survey of derandomization methods. 


13.3.5. Springs. A different path leading to the same algorithm reaches back 
to the idea of rubber band representations—with a twist. Suppose that we have 
somehow placed the nodes of the graph in space, conveniently on the unit sphere 
in some Euclidean space. Take a random hyperplane H through the origin. This 
partitions the nodes into two classes (Figure[13.4). We “hope” that this will give a 
good partition. 
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How justified is this hope? If the nodes are placed in a random position (say, 
their location is independently selected uniformly from the unit ball centered at the 
origin), then it is easy to see that we get half of the edges in expectation. 

Can we do better by placing the nodes in a more sophisticated way? Still 
heuristically, an edge that is longer has a better chance of being cut by the random 
hyperplane. So we want a construction that pushes adjacent nodes apart; one 
expects that for such a placement, the random cut will intersect many edges. In 
other words, we want to replace the edges not by rubber bands (as in Chapter 
[3), but by springs! Our springs will be, however, very peculiar springs, since the 
farther the endpoints are, the larger the force with which they push their endpoints 
apart. (This is certainly counterintuitive, but mathematically definable. In a small 
neighborhood of a given length, the “spring” image can be helpful, and will be used 
in Chapter |15] again.) 

Clearly we have to constrain the nodes to a bounded region, and the unit sphere 
S41 is a natural choice. (The unit ball would work just as well.) Mathematically 
speaking, we want to find a representation i++ u,; (¢ € V) of the nodes of the graph 
on the unit sphere in R@ so that the following “energy” is minimized: 


(13.29) —S 0 (uj-uy)?. 


What is the minimum of this energy? If we work in R!, then the problem is 
equivalent to the Maximum Cut problem: each node is represented by either 1 or 
—1, which means that a placement is equivalent to a partition of the node set; the 
energy is the negative of the number of edges connecting the two partition classes. 
So at least we have been able to relate our placement problem to the Maximum 
Cut problem. 

Unfortunately, the argument above also implies that for d = 1, the optimal 
representation is NP-hard to find. For d > 1, the negative of the minimum energy 
—E€opt May not be equal to the size of the maximum cut, but it is still an upper 
bound. While I am not aware of a proof of this, it is probably NP-hard to find 
the placement with minimum energy for d = 2, and more generally, for any fixed 
d. The surprising fact is that if we do not fix d (equivalently, for d = n), such 
a representation can be found in polynomial time using semidefinite optimization! 
This has been the content of our discussion in this section. 


13.4. The MAX-3SAT problem 
The Satisfiability Problem is perhaps the most central algorithmic problem in 


computer science. We have Boolean variables x71,...,2%,. A literal is a variable or 
its negation. A clause is a disjunction of literals: c = y1V---Vyx. A conjunctive 
normal form is a conjunction of clauses: ®(21,...,2%n) = C1A-:::ACm. A k-form is 


a conjunctive normal form in which every clause contains & literals. 

The Satisfiability Problem asks for finding values for the variables making 
®(x1,...,n) true (in other words, making every clause true). This is an NP-hard 
problem (in a sense, the quintessential NP-hard problem), and therefore many 
versions and special cases have been studied. It is known that the Satisfiability 
Problem is polynomially solvable for 2-forms, but NP-hard for 3-forms. 

The MAX-3SAT problem is a variant in which, given a 3-form ®, we want to 
find an assignment of values to the variables which makes as many clauses true 
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as possible. We denote by maxsat(®) this maximum number. This is, of course, 
NP-hard as well, but in this setting we may be interested in good approximation 
algorithms. 

Here we sketch an algorithm that solves the MAX-3SAT Problem within a 
factor of 7/8 of the optimum, using semidefinite optimization and the geometric 
representation obtained from it. We follow the general path outlined above: we 
consider the Boolean variables as real variables of value 0 or 1. Each clause c has a 
boolean value, indicating the truth of it: c = 1 if c is satisfied and c = 0 otherwise. 
The objective function is simply )°.c. 

As before, we use a new variable xp = 1 to make the system homogeneous, and 
want to make use of the semidefiniteness of the matrix X = xx! = (a2; )o<ij<n- 
The validity of a clause c = y; Vy; V yp can be expressed in an algebraic form: 


c= 1—-(1—-y)A1—yj)(1— ye). 
Here y; = x; if x; is unnegated, and y; = 1—2,; otherwise. Unfortunately, this 
inequality is cubic, which does not lead to linear conditions on the entries of the 
matrix X. But we are only looking for a relaxation, so we might use upper bounds 
instead of an exact expression of c. A simple bound is 


Cl Y+Yit+YR 


but this is not powerful enough. We go sort of half way, using a quadratic upper 
bound: 


(13.30) CL YH YG + Uk — Vij — VY: 
which is easy to check. There are three ways to separate one of the variables y; from 
a given clause, so every clause gives rise to three inequalities of the form (13.30). 


Defining the symmetric (n+1) x (n+1) matrix X = (xix;)?j;~9, we want to 


(13.31) maximize S°(X§,+X§; +X; - Xi -—X§) 


subject to X > 0, 
Xoo = 1, 
Xoi = Xin = Xi (l<i<n), 


where 
Xij, if i and 7 are not negated in c, 
xe = Xi — Xiz, if 7 is not negated in c but 7 is, 
a X43 —Xij, if 7 is not negated in c but 7 is, 


Xoo — Xj; — Xj; +Xij; if both 7 and j are negated inc. 


The optimum value of this semidefinite program, along with the optimizing matrix 
X, is polynomial time computable. This optimization problem is a relaxation of the 
MAX-3SAT problem: every solution of the MAX-3SAT problem yields a solution 
of (13.31) (but unfortunately not the other way around). So the optimum value of 
(13.31) is at least maxsat(®). 

The rest of the algorithm is similar to the maximum cut algorithm. We write 
X =Gram(u) with an appropriate vector labeling u in R?. Note that ug is a unit 
vector, and all the other vectors u; are contained in the sphere with radius 1/2 
about (1/2)ug (the “Thales sphere” of [0,uo]). Taking a random hyperplane H 
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through (1/2)uo, and assigning x; = 1 if u; is on the same side of H as uo, and 
x; = 0 otherwise, we get an assignment of boolean values. 

It can be proved that the expected number of satisfied clauses is at 
least tmaxsat(®): this was conjectured, and proved in some special cases in 
([Karloff-Zwick 1997), and proved completely in [Zwick 2002]. This latter proof 


is computer-assisted, and we do not reproduce it here. The constant 7/8 is best 


possible |Hastad_ 1997]. 


Exercise 13.1. How do the bounds change if we insist that different 
nodes be represented by different points of R“? 

Exercise 13.2. Let d; denote the smallest dimension in which a given graph G 
has a unit distance representation, and let dz denote the smallest dimension in 
which G has an orthonormal representation. Prove that di < dz < 4%, 


Exercise 13.3. Prove that every graph G has a unit distance representation (in 
some dimension) contained in a ball with radius 


1 1 
2 20(G) 


If the graph has a node-transitive automorphism group, then this is the smallest 
possible radius. 


Exercise 13.4. Construct the dual of the semidefinite program (13.2). Does it 
have an optimal solution? Does it have a strictly feasible solution? 

Exercise 13.5. Let Aj,...,A4, be symmetric mxm matrices. Then exactly 
one of the following alternatives holds: (i) There are 71,...,%n € R such that 
r1A,+++-+anAn = 0 and S77, eiAi #0. (ii) There exists a matrix Y > 0 such 
that A;-Y =0 (@=1,...,n). 

Exercise 13.6. Let A be an affine subspace of S. Then exactly one of the 
following alternatives holds: (i) There is an X € A, X > 0. (ii) There exists a 
matrix Y #0, Y > 0 such that either X-Y <0 for all X € CL. 

Exercise 13.7. Let Ai,..., An be symmetric m x m matrices and b;,...,bn € R. 
Then exactly one of the following alternatives holds: (i) There exist 71,...,a% € R 
such that #1A1+-+:+2nAn = 0, 5, vibi < 0, and if 0, ib; = 0 then 5°, |bi| > 0 
and y'A;y = 0 for all y € R™ with (2A; +---+a,An)y = 0. (ii) There exists a 
matrix Y > 0 such that A;-Y = }; for all i. 

Exercise 13.8. Let G be a connected simple graph. (a) If G is eulerian with an 
even number of nodes, then it has a spanning subgraph G" such that every node 
i is incident with deg(i)/2 edges of G’. (b) If G is eulerian with an odd number 
of nodes, then it has a spanning subgraph G’ such that every node except one 
i is incident with deg(i)/2 edges of G’. (c) If G is not eulerian, then it has a 
spanning subgraph G’ such that every node i is incident with either |deg(i)/2| or 
[deg(i)/2] edges of G’. 

Exercise 13.9. Show that the Maximum Cut Problem is a special case of the 
MAX-2SAT problem. 


Exercise 13.10. Find infinitely many linegraphs for which equality does not hold 


in (1527). 


CHAPTER 14 


Stresses 


We can introduce this chapter as picking up the topic of Chapter [3] studying 
frameworks in equilibrium in which the edges are rubber bands and also springs 
and rigid bars; mathematically, all that matters are the forces that act along the 
edges, which we describe as equilibrium stresses on frameworks. The physical in- 
terpretation of the edges does not matter here: in this setting, stresses (as well as 
braced stresses, to be introduced below) are simply solutions of appropriate linear 
equations. 

Physical experience helps to visualize several of the arguments. Depending on 
what carries the force along the edges, we can think of various physical models that 
can be described by the above mathematical definition. 


— We can think of an edge 77 with a positive stress as a rubber band with this 
strength. As we have done before, we think of an edge with negative stress as a 
peculiar “spring” that pushes its endpoints apart. 


— We can think of all edges as rigid bars, to get a bar-and-joint framework. 
In a resting position, such a framework will not carry any stress, but if we replace, 
say, one edge by a rubber band, then it will either move or carry a stress. Stresses 
are important tools in understanding the rigidity of the framework. 


— One can extend the previous model by taking the signs of stresses into 
account. An edge that can carry a positive stress, but not a negative stress, is 
called a cable; if it is the other way around, it is called a strut. A framework with 
its edges classified as “cables” and “struts” is called a tensegrity framework. 

From an engineering point of view, this chapter studies static properties of 
(say) bar-and-joint frameworks; dynamic properties will be the subject matter of 
the next chapter. Our main goal is to point out the many interesting connections 
between graph theory and geometry this setup leads to, so we do not attempt to 
be “practical”. 


14.1. Stresses and stress matrices 


Let us fix a vector-labeling u: V —> R? of the graph G. A function S: E > R 
will called an equilibrium stress (or simply a stress) on (G, u) if 


(14.1) S” Sij(uj—uj)=0 (for every i € V). 

JEN (i) 
Since the stress is defined on edges, rather than oriented edges, we have tacitly 
assumed that S;; = S;;. We will extend the notation to nonadjacent pairs 7,7 by 


5;; = 0 (the values 5;; play no role here; we will return to a useful way of defining 
them soon.) 
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In the language of physics, we want to keep node i in position u; (think of the 
case d = 3), and we can do that through forces acting along the edges of G. Then 
the force acting along edge 77 is parallel to u;—u,, and so wit can be written as 
Sij;(uj; —u;). Newton’s Third Law implies that 5;; = $;;. The stress on an edge can 
be zero, negative or positive. A positive stress pulls each endpoint in the direction 
of the other, a negative stress pushes them apart. Equation means that every 
node is in equilibrium. The framework is stressed, if it carries a not-identically-zero 
equilibrium stress, and stress-free otherwise. 

Let us emphasize the importance of the symmetry S;; = S;;. Without this, 
equations would be independent for different nodes i, and a nonzero stress 
would simply mean that at least one node and its neighbors are affine dependent. 

We can arrange a stress on the given framework into a matrix S € RY*", 
where Sj; = 0 if ij € E, and the diagonal entries are defined by 


(14.2) Sit = — S- Sik 


ke N(2) 


(so that each row-sum is zero). This matrix is symmetric. Combining the vectors 
u; into ad x V matrix U, the definition of a stress and the definition of the diagonal 
entries can be expressed as 


(14.3) US=0 and 1'S=0. 


If the representation is affinely full-dimensional, then the vector 1 and the rows 
of U are d+1 linearly independent vectors in the nullspace of S, giving an upper 
bound on its corank: 


(14.4) cork(S) > d+1. 


In a rubber band representation, the strengths of the edges “almost” form a 
stress: holds for all nodes that are not nailed. In this language, Exercise [3.5] 
says the following: Every rubber band representation of a simple 3-connected planar 
graph with an infinite triangular country has a stress that is 1 on the internal edges 
and negative on the boundary edges. 


FiGcurRE 14.1. (a) A framework consisting of a square with diag- 
onals and a stress on it. Simple edges represent positive stress 
values (pulling), double edges represent negative ones. (b) Stress 
on a “general” quadrilateral. The intersection point of the diago- 
nals divides them in the ratio of 6: 6 and a: 7, respectively. (c) 
Stress from a rubber band representation: 1 on the interior edges, 
negative on the boundary edges. 
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Every stress on a framework (G,u) defines a G-matrix. Clearly stresses on a 
given graph G with a given vector-labeling u form a linear space, which we can 
consider as a space Strg.u C R®. 

For this chapter and the next, we assume that the graphs we consider have no 
isolated nodes; this excludes some trivial complications here and there. 


14.1.1. Stability and the spectrum of the stress matrix. Consider a 
framework with a stress S on the edges. As before, we consider S as a symmetric 
matrix, whose diagonal entries are defined by (14.2), so that (14.3) holds. Let us 
interpret the edges as rubber bands and springs, where rubber bands correspond 
to positive stresses and springs correspond to negative stresses. The stress 5j; 
describes the strength of the edge 77. The definition of a stress means that the 
framework is in equilibrium. Is this equilibrium stable? In other words, can we 
move the nodes so as to decrease the energy of the framework? 

For any vector labeling u, and stresses S;;, define the energy as 


u) = S- Sij)u; —uyl?. 
ijEE 
If the equilibrium condition (14.1) holds, then 
(14.5) E(u) => ye Sj;\uj; —u,|? = S- S- S;;u; (uj — uj) = 0. 
ijEE i jJEN(i) 
A similar computation gives more. Suppose that we move each node i to a new 
position u;+v;; then the energy of the new framework is 


E(u+v) =, Sij\u; — us|? + > Sijlvj — vil? +2 S- Sij(vj —vi) "(uj — uy). 
Gk jek ijEE 
Using the equilibrium condition, we see that the last term vanishes: 
2 7 Sig(vj — vi) "(uy — us) 
ijeE 
= =~ a Sijvi(u y-u)+> > S- Sigv} (Wi — Uy) = 0. 
i jEN(i J ie N(j) 
Thus we get that if in ne u the framework is in equilibrium, then 
(14.6) E(ut+v) = E(v). 


It follows that the framework is in stable equilibrium if and only if E(v) > 0 for 
every assignment of vectors v; to the nodes. It is easy to check that for every vector 


xc eR’, 
a'Szr= -S Siz ( x4—2;)" 
jek 
Thus €(v) > 0 for every vector labeling v if and only if S is negative semidefinite. 
To sum up, 


Proposition 14.1. A framework (G,u) in R@ with stress matrix S is in stable 
equilibrium if and only if it is in equilibrium and S is negative semidefinite. 


Notice that if S is a stress on a framework, then so is —S; but if S 4 0 is stable, 
then —S is not stable, since the negation of a negative semidefinite nonzero matrix 
is never negative semidefinite. 
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Let S be a negative semidefinite G-matrix with zero row sums (and column 
sums). Then the corresponding energy function satisfies 


E(u) = > Sijlui—uyl? = S> Sigal? + > Sijlus?-25— Siguluy. 
i ij i ij 


The first two terms are zero, and hence €(u) > 0; furthermore, equality holds if 
and only if > j S;;u; = 0 for every node i. This is clearly equivalent to saying that 
S is a stress on (G, u). 

A further consequence of equation characterizes equilibrium positions as 
stationary positions for the energy function (cf. Exercise [14.5). 


14.1.2. Stresses on convex polygons. We study planar frameworks that 
consist of a convex polygon and some of its diagonals. 


Example 14.2. Let us consider the first nontrivial case, the complete graph Ky 
represented in the plane by the vertices of a convex quadrilateral P. This framework 
must carry a stress; if we delete an edge, and realize the remaining edges by rigid 
bars, then we get a rigid framework, so if we put back the edge as a rubber band, 
the geometry does not change. This means that the force with which this last edge 
acts must be counterbalanced by forces acting along the other edges, which yields 
a stress S. (This stress is explicitly computed in Figure [14.1}) 

The equilibrium condition implies that if S is positive on one of the 
boundary edges, then it must be positive on all other boundary edges, and negative 
on the diagonals. This implies that the stress is uniquely determined up to scaling 
(from two essentially different equilibrium stresses we could create one that is zero 
on one edge but not all by taking linear combination). By (14-4), the rank of S is 
1. It is not hard to see that the sum of stresses on the three edges incident with a 
given node is positive, so the diagonal entries are negative. This implies that the 
one nonzero eigenvalue of S is negative. Thus S is negative semidefinite. 4 


It is clear that if a framework consisting of a convex polygon and it carries a 
nowhere-zero stress, then every vertex must be incident with a diagonal (else, the 
forces acting along the two edges incident with this node could not be in equilib- 
rium). The following lemma generalizes this observation for stresses with a specific 


signature (see Figure|14.2). 


Lemma 14.3. Let (G,u) be a framework in the plane consisting of a convex polygon 
and some of its diagonals, which carries a stress that is positive on the boundary 
edges and negative on the diagonals. Then G is 3-connected. 


Proof. Suppose that G can be separated as G = G,UG»2, where V(Gi)N 
V(G2) = {a,b}. One boundary edge incident with a belongs to Gi, the other 
belongs to Gg; indeed, if both belong to (say) G1, then the whole boundary belongs 
to G, (walking around we cannot switch from G to G2 and back), and hence the 
only diagonal belonging to G» is ab, but this is not a proper 2-separation. 

Let ac be a boundary edge belonging to G1, and let F' be the force with which 
G, acts on the point u, (Figure [14.3{a)). Since a is in equilibrium, G2 acts on ug, 
with a force of —F’. The moment of F relative to the point u, must be zero, since 
the other forces acting on G2 act at up. So F is parallel to u, — ug. We may assume 
that F' points from u, in the direction of uy, (else, we can interchange the roles of 
G, and G2). 
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(a) (b) (c) 


FIGURE 14.2. Structure (a) carries a nowhere-zero stress in the 
plane, positive on the edges of the polygon, but negative on the 
diagonals (see the proof of Theorem [14.4). Structures (b) and (c) 
carry no such stress. In the case of (b), the rightmost edge could 
pull its endpoints closer without changing the length of any other 
edge. For both of (b) and (c), Lemma|14.3] implies this, since they 
are not 3-connected. 


Let e be a line through u, separating u, from the other vertices of P. Then 
all the forces Sjg(uj—Ua), where i € V(Gj), point to one side of e (due to the 
assumption about the signs of the stresses $;). This means that their sum, F, 
cannot point in the direction of uy, a contradiction. 


FIGURE 14.3. (a) A convex polygon with positive stress on the 
edges and negative stress on some diagonals cannot have a 2- 
separation. (b) Such a stress must have corank 3. 


Now we are able to prove the following important theorem on stresses on convex 
polygons {Connelly 1982): 


Theorem 14.4. Let (G,u) be a framework in R? whose nodes are the vertices of 
a convex polygon P; its edges are the edges of P and some diagonals of P. Suppose 
that G has a stress S that is positive on the boundary edges and negative on the 
diagonals. Then S is negative semidefinite and has corank 3. 


Proof. The proof consists of three main steps. 
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Claim 1. The corank of S is 3. 


We have seen already that it is at least 3. Suppose that there is a vector x in 
the nullspace of S that is linearly independent from the coordinate functions and 
the vector 1. Consider the 3-dimensional vector labeling 

iOVvi= (“) (GeV). 
Li 
Our choice of x implies that the vectors v; are not contained in a plane (Figure 
[14.3(b)). Thus their convex hull is a 3-polytope Q. The stress S will be a stress on 
the 3-dimensional framework (G, v). 

Projecting @ to the plane of the first two coordinates, we get the polygon P. 
Every vertex of P is the projection of a unique vertex of @. The polytope Q must 
have an edge e connecting vertices vz and vy that projects onto a proper diagonal 
of P (not necessarily onto an edge of G). Nodes a and b split the boundary of P 
into two arcs A and A’. 

All outside forces acting on A come from (a) diagonals connecting the interior 
of A to the interior of A’, or (b) from edges of G incident with i or j. Every 
force of type (a) tries to rotate A about the edge (ug, u,) in the same direction; 
forces of type (b) have no moment relative to this axis. Since there is at least one 
diagonal of type (a) by Lemma{14.3} the framework cannot be in equilibrium. This 
contradiction proves Claim 1. 


Claim 2. There is at least one graph G’ = (V, E’) and a stress S’ on (G’,u) such 
that S’ is positive on the edges of P, negative on the diagonal edges, and negative 
semidefinite. 


Let K,, be the complete graph on [n], where i is labeled by the vector u;. For 
any three consecutive vertices uj, Uj41, Ui+2 (2 < i < m—2), consider a stress S; on 
the complete 4-graph with vertices u,, u;, Uj+1,U;+2, that is positive on the edges 
of the quadrilateral and negative on the diagonals (as in Example (14.2). We can 
consider S$; as a stress on (/,, u). The sum of these stresses S; gives a stress that is 
positive on the edges of P, negative on diagonals connecting second neighbors along 
the arc from 2 to n, and zero on the other diagonals—except for those diagonals 
incident with node 1. We can scale these stresses before adding them up, and we 
are going to scale them so that their sum is zero on all diagonals incident with 1 
except (1,n—1). We start with S; then we scale 53 so that adding $3 kills the 
stress on (1,3). Then we add an appropriately scaled version of S4 to kill the stress 
on (1,4) etc. This way we can kill all the stresses on diagonals from 1 except on 
(1,n—1) (Figure [14.4). 

But we have to prove that we scale with positive numbers! We prove this by 
induction on i = 3,4,.... Consider the stress Sf = S2+S3+...S;-1. This is 
negative on the boundary edges of the convex polygon (uj, U2,...,U;+1), and zero 
on the diagonals incident with 1 except for (1,7). Since u; is in equilibrium, S} 
must be negative on (1,7). So to kill the stress on (1,7), we must add a positive 
multiple if $;. This proves Claim 2. 


Claim 3. S is negative semidefinite. 


We know that there is another graph G’ on V(P) whose edges are the edges 
of P and some set of diagonals of P, that has a stress S’ that is positive on the 
boundary edges and negative on the diagonals, and that is negative semidefinite. 
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FIGuRE 14.4. Making the stress zero on all but one diagonals from 
node 1. 


The matrices S, = tS+(1-—t)S’ (0 < t < 1) are stresses on the graph GUG’, of 
the type in the theorem. By Claim 1, they all have corank 3. 
Since the eigenvalues are continuous functions of the matrix, the sets 


A= {t: S, negative semidefinite} 
B={t: S; has > 4 nonnegative eigenvalues} 


are closed and their union is [0,1]. They are also disjoint, since a matrix in both 
of them would have corank 4. So one of them is empty; since 0 € A, it follows that 
1 € A, and so S = Sj is negative semidefinite. This completes the proof of the 
theorem. 


14.1.3. Stresses on convex polytopes. Let us move up to the 3- 
dimensional space. An important result about stresses is a classic indeed: it was 
proved by Cauchy in 1822. 


Theorem 14.5. The skeleton of a 3-polytope carries no nonzero stress. 


Proof. Suppose that the edges of a convex polytope P carry a nonzero stress 
S. Let G’ be the subgraph of its skeleton formed by those edges which have a 
nonzero stress, together with the vertices they are incident with. As in Figure[14.5} 
we draw an edge ij of G’ solid, if S;; > 0, and dashed if S;; < 0. Edges of P with 
zero stress are dotted. 


FIGURE 14.5. Why a polytope with an equilibrium stress cannot 
have a “quiet” node. 
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The graph G’ is planar, and in fact it comes embedded in the surface of P, 
which is homeomorphic to the sphere. By Lemma G’ has a “quiet” node 3, 
such that the solid edges (and then also the dashed edges) incident with i are 
consecutive in the given embedding of G’. This implies that we can find a plane 
& through u; which separates (in their embedding on the surface of P) the solid 
and dashed edges incident with 7. Let e be the normal unit vector of © pointing 
in the halfspace which contains the solid edges. Then for every solid edge 17 we 
have e'(u;—u,;) > 0, and for every dashed edge ij we have e'(u;—u;) < 0. This 
means that we have S;;e'(uj;—u;) > 0 for every edge ij of G’. Also by definition, 
we have $;;e'(u; —u;) = 0 for the dotted edges. Since i is a node of G’ and hence 
it is incident with at least one edge with nonzero stress, we have 


S> S,;e'(u;—u,) > 0. 
JEN (2) 
But 
S- S,j;e"(uj — uj) = eT( S- Sij(uj —ui)) =0 
) 


jEN(i JEN (i) 


by the definition of a stress, a contradiction. 


The most interesting consequence of Cauchy’s Theorem is that if we make a 
convex polyhedron out of cardboard, then it will be rigid (see Section|15.2). 


14.2. Braced stresses 


Let G be a simple graph, and let u: V — R¢ be a vector-labeling of G. We 
say that a function S: E — R is a braced stress on (G, u), if there are appropriate 
values S;; € R such that for every node 2, 


(14.7) Siguy+ S- SijUj =0 
JEN (i) 


As before, we define $;; = 0 for ij € E, to get a symmetric G-matrix 9, so that we 
have the equation 


(14.8) US =0 


(where U is the matrix whose columns give the vector-labeling u). It follows that 
if the labeling vectors span the space R?@ linearly, then the corank of the matrix S$ 
is at least d. 

Clearly ” proper” stresses are special braced stresses: if S is a stress on (G, u), 


then 
» SijUj = S- Sij) ui. 


JEN (i) JEN (2) 


The name “braced stress” can be explained by the following physical interpre- 
tation of (14.7): We add a new node 0 at position up = 0 to G, and connect it to 
every old node, to get a graph G’ (Figure |14.6). We define a new (proper) stress 
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S’ by 
i0 = Soi = Sis, (je V), 
Si, = — ba Sik, (i € VU{O}). 


kE{is}UN (i) 


Then S’ will be a stress on the edges of this representation of G’: the equilibrium 
condition at the old nodes is satisfied by the definition of S, and this easily implies 
that it is also satisfied at the new node. 


FIGURE 14.6. Rubber band representations of the octahedron 
graph, framed and braced. 


In the case when all stresses (on the original edges) have the same sign, we 
can imagine this framework as follows: the old edges are strings or rubber bands of 
appropriate strength, the new edges are rods attached to the origin, and the whole 
framework is in equilibrium. 


Remark 14.6. We can study equation from two different points of view: 
either we are given the vector-labeling U, and want to understand which G-matrices 
M satisfy it; this has lead us to “stresses”, the main topic of this chapter. Or we 
can fix M and ask for properties of the vector-labelings that satisfy it; this will 
lead us to “nullspace representations”, which will be important tools in Chapter 
(It is perhaps needless to say that these two points of view cannot be strictly 
separated.) 


Remark 14.7. Sometimes we consider a version where instead of (14.1), we assume 
that 


JEN (t) 


Such an assignment is called a homogeneous stress. Thus we have defined three 
kinds of stress matrices: “ordinary”, braced, and homogeneous. Each of these 
versions can be described by a G-matrix satisfying the stress equation US = 0. 
Homogeneous stress matrices are those that, in addition, have 0’s in their diagonal. 
“Ordinary” stress matrices have zero row and column sums. “Ordinary” stresses 
remain stresses when the geometric representation is translated, but homogeneous 
and braced stresses do not have this property. 
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14.3. Discrete Nodal Theorems 


The model for the main results in this section is the Nodal Theorem of Courant. 
Informally, this theorem states that the support of an eigenfunction of Laplacian- 
like operators on a bounded domain, belonging to the k-th smallest eigenvalue, has 
at most k connected components. We’ll see that this result about partial differential 
equations has very natural and useful analogues for graphs. 


14.3.1. Eigenvectors of well-signed G-matrices. Let us fix a connected 
graph G, let M be a well-signed G-matrix of corank d. We study the spectrum 
of M, starting with an easy observation. The Perron—Frobenius Theorem easily 
implies the following fact: 


Lemma 14.8. The smallest eigenvalue of M has multiplicity 1, and all entries of 
the corresponding eigenvector have the same sign. 


Proof. For an appropriately large constant C' > 0, the matrix CI — M is non- 
negative and irreducible. Hence it follows from the Perron-Frobenius Theorem that 
the largest eigenvalue of CI—M has multiplicity 1, and all entries of the corre- 
sponding eigenvector have the same sign. This implies that the same holds for the 
smallest eigenvalue of M. 


Turning to the second smallest eigenvalue, we may assume that it is 0 (this 
can be achieved by adding an appropriate constant to the diagonal, which changes 
neither the order and multiplicities of the eigenvalues nor the eigenvectors). 

A key lemma concerning well-signed G-matrices, which can be con- 
sidered as a discrete version of Courant’s Nodal Theorem, was _ proved 
in [van der Holst 1995]. Subsequently, various new versions and _ gener- 
alizations of this lemma were obtained; see [van der Holst et al. 1995b], 
fvan der Holst ct al. 1999], [Lovasz Schrijver 1999], [Colin de Verditre 1998), 

In its simplest form, the Discrete Nodal Theorem says that if the second small- 
est eigenvalue of a well-signed G-matriz has multiplicity 1, then the positive and 
negative supports of the corresponding eigenvector induce connected subgraphs. This 
does not remain true, unfortunately, if the eigenvalue has multiplicity larger than 
1 (and this will be an important situation for us): 


Example 14.9 (Claw). Consider the graph G = Kk 3 (the 3-star or “claw”), and 
the following well-signed G-matrix: 


-1 0 0 0 
Me -1 0 0 0 
-1 0 0 0 


This matrix has eigenvalues (—1,0,0,3), and its nullspace consists of all vectors 
of the form (0,2, y,z)', where r+y+z= 0. So for most vectors in the nullspace, 
either the positive or negative support consists of two nonadjacent nodes, and so it 
is disconnected. Note, however, that those vectors in the nullspace with minimal 
support look like (0,0,2,—a), for which both the positive and negative supports 
are connected in a trivial way. 4 
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However, these exceptional eigenvectors have a very specific form, which we 
describe in detail for the purposes of later applications. Some simpler conditions 
that exclude these exceptions are stated as corollaries. 


Theorem 14.10 (Discrete Nodal Theorem). Let G be a connected graph, let 
M be a well-signed G-matrix with one negative eigenvalue, and let0 #4 x € Ker(M). 
Let N, ={i€V: «2, =0}. Then the sets supp,(x) and supp_(x) are nonempty, 
and either 

(i) both graphs G[supp,(x)| and G[supp_(x)] are connected, or 

(ii) G[supp(a)] has connected components Hy,...,H, (38 < k < cork(M)+1), 
each of which is contained in either supp,(x) or supp_(x). Furthermore, if 2; 


denotes the restriction of x onto H;, extended by 0’s so that it is a vector in RY, 
then supp(Mz;) C N, and Mz, || Mz; for alll <i<j <k. 

For the graph structure it follows that N(H;)\V(H;) is the same subset of Nz, 
independently of 7. 


Proof. By the Perron—Frobenius Theorem, M has a positive eigenvector y 
belonging to the negative eigenvalue. From the orthogonality relation y'x = 0, it 
follows that the sets supp, (a) and supp_(a) are nonempty. Suppose that at least 


one of them, say supp,(x), induces a disconnected graph. Let H1,...,Hg and 
Ha41,..-, Hp be the connected components of supp, (a) and supp_ (a), respectively, 
then a > 2 and k > 3. Clearly z; > 0 fori <a, z; <0 fori >a, 2,...,z% are 


linearly independent, and z = )>, z;. Let Z = (z,..., 2%) € RY**. 

The main step in the proof is to understand the symmetric k x k matrix A = 
Z'MZ. (It will turn out that A = 0.) It is easy to figure out the signs of the entries 
of A. Clearly Ai; = 24 Mz; =Oforl<i<j<aanda+1<1i1<j<k, since there 
is no edge between V(H;) and V(H,) in this case. We have Aj; = 2) Mz; > 0 for 
l<i<a<j<k, since M,, <0 and 2,2, < 0 for u € V(A;),v € V(4H;) in this 
case. Furthermore, >/, z)Mz; = z}Max = 0, and hence Ajj = z} Mz; < 0 for each 
1<i<k. 

To sum up, A has nonpositive diagonal, nonnegative entries off the diagonal, 
and zero row-sums. Thus the matrix —A is diagonally dominant, and hence pos- 
itive semidefinite, which implies that A is negative semidefinite. By the Inertia 
Theorem, the number of nonpositive eigenvalues of A is not larger than the number 
of nonpositive eigenvalues of M, and hence k < d+1. 

Let 1 <i < j < a, and consider the vector w = (z}y)2i— (zy) z;- Clearly 
w #0, but w'y = 0. Hence w is a linear combination of eigenvectors belonging to 
the nonnegative eigenvalues, and so w'Mw > 0. On the other hand, using that A 
is negative semidefinite, 


w'Mw = Au(zy)? — 24a; (21 y)(z]y) + Agj(zty)? < 0. 


This implies that w'Mw = 0, and so w is a linear combination of eigenvectors 
belonging to the zero eigenvalues; this means that Mw = 0. Hence Mz; || Mz;, 
and so supp(Mz;) = supp(Mz;). 

Let U denote this common support. Since there is no edge connecting H; and 
Hy, the set U = supp(M21) is disjoint from H2, and similarly UNV(H;) = 9 for 
1<i<a. But this also implies that UNV(H,) = 0 for a+1 <j <k: else, there 
would be an edge connecting H; (j > a) and H;, showing that U = supp(Mz;) 
contains a node of Ay. 
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We note that if, in addition, M has the Strong Arnold Property, then k = 3 
in (ii) of Theorem Indeed, if k > 4, then u = (zdy)z,—(z}y)zo and w = 
(zdy) za —(z}y)z3 are vectors in the nullspace of M with disjoint supports, and no 
edge connects their supports. This contradicts Lemma[I0.26| 

The following corollary states some conditions under which the awkward pos- 
sibility (ii) can be excluded, facilitating an easier application of the lemma. 


Corollary 14.11. Let G be a connected graph, let M be a well-signed G-matriz 
with one negative eigenvalue, and let x € Ker(M), 2 £0.. Assume that one of the 
following conditions hold: 


(a) has minimal support among all nonzero vectors in Ker(M); 
(b) a has full support; 
(c) G is a 3-connected planar graph. 
Then G[supp,(x)] and G[supp_(x)] are nonempty connected subgraphs. 
Proof. It is straightforward to see that case (ii) in Theorem cannot occur 
under conditions (a) or (b). Suppose that (c) holds and alternative (ii) occurs. Since 
Vo = N(V(Aj)) \V(Aj) is a cutset, we have |Vo| > 3. Contracting each H; to a 
single node, we get a minor containing 3.3, which is impossible as G is planar. 


Some further versions of this lemma are stated in Exercises 14.11 

We can re-state the nodal lemma in a geometric form (see Figure (14.7). Let 
M be a well-signed G-matrix with one negative eigenvalue, let u: V —> R¢ bea 
nullspace representation belonging to M, and U, the corresponding matrix. The 
nullspace of M consists of all vectors of the form U's, where s € R?. 


FIGURE 14.7. Two possibilities how the nullspace representation 
of a connected graph can be partitioned by a hyperplane through 
the origin. 


Let H be a linear hyperplane in R¢. We denote by Vo, V+ and V~ the set of 
nodes represented by vectors on H, on one side of H, and on the other side of H, 
respectively. In terms of the vector x = Us in the nullspace of M, where s is a 
normal vector of H, we have Vt = supp,() etc. So it follows that Vt and V~ 
are nonempty, and “usually” induce connected subgraphs. 

In the exceptional case when one of G[V*] and G[V~] is disconnected, no edge 
connects Vt and V—. As further geometric information, there is a subspace S C H 
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of dimension d—2, and there are open half-hyperplanes T,,...,T, (k > 3) with 
boundary S’,, so that 

e all nodes in Vo, and only those, are represented in S, and these vectors 
generate S; 

e the nodes represented in T; induce a connected subgraph H;; 

e all neighbors of H; outside H; belong to Vo; 

e every node in Vo is connected either to every H;, or to none of them. 


Let us rotate the hyperplane H about the subspace S. In each position, both 
halfspaces bounded by H will contain at least one of the half-hyperplanes T;, and so 
they can never contain all of them. In each position where one of these halfspaces 
contains at least two of the half-hyperplanes TJ; we get an exceptional case. This 
gives the following geometric descriptions of the two cases covered by Corollary 
the subspace #H is linearly spanned by some of the vectors u;; H does not 
contain any of the vectors u;. 

As a further application of the Discrete Nodal Theorem, we prove a fact that 
will be important in several chapters of this book. 


Theorem 14.12. Let G be a 3-connected planar graph and let M be a well-signed 
G-matriz with exactly one negative eigenvalue. Then cork(M) < 3. 


Proof. Suppose not. Let a1, a2 and ag be three nodes of the same country 
(say, the unbounded country), then there is a nonzero vector x = (a; : i € V) 
such that Ma = 0 and x, = 2a, = a, = 0. We may assume that x has minimal 
support. 

By 3-connectivity, there are three paths P; connecting the support S of u to a; 
(i = 1, 2,3), such that they have no node in common except possibly their endpoints 
b; in S. Let c; be a node on P; next to b;, then clearly x,, 4 0 but x,, = 0. From 
the equation Mx = 0 it follows that c; must have a neighbor d; such that xa, 4 0, 
and x, and xg, have opposite signs. We may assume without loss of generality 
that x, > 0 but rg, < 0 for i =1,2,3 (Figure 14.8). 


FIGURE 14.8. Paths connecting three nodes of the unbounded 
country to the negative and positive supports 


Create a new node in the interior of the country F and connect it to a1, a2 
and a3. Using that the positive support S; as well as the negative support S_ of 
x induce connected subgraphs by Corollary [14.11] we can contract them to single 
nodes. We get a subdivision of K’3 3 embedded in the plane, which is impossible. 
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14.4. Braced stresses on polytopes 


14.4.1. Existence. In contrast to Cauchy’s Theorem 3-dimensional 
polytopes do have braced stresses. 
Theorem 14.13. The skeleton of every convex 3-polytope containing the origin in 
its interior has a braced stress that is positive on every edge of the polytope. 

Of course, we could require the braced stress to be negative on all edges. 

Proof. Let P be a convex polytope in R? such that the origin is in the interior 


of P, and let G be the skeleton of P. Let P* be its polar, and let G* = (V*, E*) 
denote the skeleton of P* (see Figure}14.9). 


FIGuRE 14.9. An edge of a polytope and the corresponding edge 
of its polar. 


Let u; and u,; be two adjacent vertices of P, and let w, and w, be the endpoints 
of the corresponding edge pq € E*, where the p is on the right of the edge 77, oriented 
from 7 to 7, and viewed from outside. Then by the definition of polarity, we have 

u; Wp u; Wy uj W) Uj Wy 1. 
This implies that the vector w, — Wz, is orthogonal to both vectors u,; and u;. Here 
u; and u; are not parallel, since they are the endpoints of an edge and the origin 
is not on this line. Hence wy — wy, is parallel to the nonzero vector u; x u;, and we 
can write 


(14.10) Wq-— Wp = Sig (Ui x u;) 
with some real numbers 5;;. Clearly S;; = Sj; for every edge 17, and our choice for 
the orientation of the edges implies that 5;; > 0. 


We claim that the values $;; form a braced stress on (G,u). Let i € V, and 
consider the vector 


Then 


where the last sum extends over all edges pq of the facet of P* corresponding to 2, 
oriented counterclockwise. Hence this sum vanishes, and we get that u; x uj = 0. 
This means that u; and ul, are parallel, and there are real numbers S;; such that 
uj = —S;,u;. This proves that the values $;; form a braced stress. 
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We call the special braced stress constructed above the canonical braced stress 
of the polytope P, and denote the corresponding matrix by S = Sp. We express 
the canonical stress as 

[Wa — Wp | Wa — Wp| 


14.11 Be = 
ca t= Vasey] Ral ay 


Hence we have a simple geometric interpretation: S;; is the length of the dual edge, 
divided by twice the area of the triangle formed by the edge and the origin. 
Taking the cross product of with w, we get the equation 
(14.12) 
' 


Wy X Wp = Siz ((ui x uy) X Wp) = Siz (uj; wp)uy — (ujwp)ui) = Si;(uj — ui). 


This shows that the canonical braced stresses on the edges of the polar polytope 
are simply the reciprocals of the canonical braced stresses on the original. 

The matrix Sp depends on the choice of the origin. We can even choose the 
origin outside P, as long as it is not on the plane of any facet. The only step in 
the argument above that does not remain valid is that the stresses on the edges are 
positive. 

The existence, and these elementary properties, of the canonical braced stress 
are nice facts in themselves, but we will need to spend some pages on the study of 
its highly nontrivial properties, after presenting some general results about spectra 
and eigenvectors we need. 


14.4.2. Spectra. Returning to the canonical braced stresses, our main goal 
is to determine the signs of their eigenvalues, and more generally, the signs of 
eigenvalues of braced stresses on convex polytopes. To this end, we relate the 
canonical braced stress matrix to volumes and mixed volumes. 

It is not hard to prove (see Exercise|14.7) that the canonical stress matrix and 
volume are related: 


| 
(14.13) vol(P*) = & ye Be. 


This simple identity is not the last word on this issue. Let P be a convex 3-polytope 
containing the origin in its interior, and let u; (i €¢ V = V(P)) be its vertices. For 
y € RY, y > 0, define 


Qy ={xER*: ujx < y; Vie VI. 


Clearly Q, is a convex 3-polytope, and 0 is in its interior. In particular, Qy = P*. 
We compute the gradient and Hessian of the function f(y) = vol(Q,). (Note that 
f is not everywhere an analytic function of y; it is not analytic at the values of y 
where @, has vertices of degree larger than 3. But the proof below will show that 
it is twice continuously differentiable.) 
Let i denote the facet of @, corresponding to 7 € V. It is easy to see that 
fa) area(F") 


(14.14) 5 f@) = ae 


Hence the outer normal of EY of length area( Fy) is 


0 


F’)u? = 
area(E")u; Oui 


y 


f(y)ui- 
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The sum of these outer normals is zero; hence we get the equation 
O 
(14.15) » Ou (y)u; = 0. 


The next important step is the following equation expressing the canonical 
stress matrix S as the Hessian of the volume of Q,: 


(14.16) 


To prove identity (14.16), let H denote the matrix on the left. Continuing from 
(14.14), we want to find 
6? area( Fi ste, )—area(F}) 


eS) a 


for 7 4%. We consider the case when t \, 0 (the case of negative values of t is simi- 
lar). If j is not adjacent to i, then Fj ,,., = Fj, and so Hi; = 0. Assume that 7j is 
an edge of P, and let pq be the corresponding edge in P*. The difference in the nu- 
merator in (14.17) is the area of a strip along the edge of Fj connecting w, and wy. 
Elementary computation shows that the width of this strip is t/(|u,| sin Z(u;, u;)). 
It follows that 
; . t|w,—w,| 
Fiste,)— Fi) = 1? + of”). 
area( 1+te, ) area( 1) u,| sin 4(uij, uj) at ( ) 
(The error term O(t?) comes from the ends of the strip.) Substituting in (14.17) 


and using (14.11), we get 


H;; = |W» — Wa 


Ju;||uj|sin (uj, uj) ag 

So the off-diagonal entries of H and S are the same, and hence H—S is a 
diagonal matrix. Differentiating with respect to y;, we get the equation )> ; Aiguy = 
0, or more compactly UH = 0. We know that US = 0, and hence U(H—S) = 0. 
Since no column of U is 0, it follows that H—S = 0. This completes the proof of 
(14.16). 

Our next goal is to relate the canonical stress matrix to the mixed volumes of 
these polytopes Q, (for the definition and properties of mixed volumes, we refer 
to Appendix [C.5] and for a presentation of greater depth, to [Schneider 1993], in 
particular Sections 5.1 and 6.3). We call a vector y € RY proper, if for every 
w € R® the faces of Qi and Q, maximizing the objective function w'x have the 
same dimension. (This can be phrased as “Q and Q, are strongly isomorphic.” ) 
If Q is simple, then every vector y that is close enough to 1 € RY” is proper. For 
two proper vectors y and z, the vector 1+sy+tz is proper, and 


Qi+sy+tz = Qi +sQy +tQ:. 
(This takes some work to check; see Exercise}14.14|) From this, we can determine 
the mixed volume V(Q1, Qy,Q-). Indeed, by the definition of the mixed volume, 
the volume of Qi+45,+¢2 can be expressed as a polynomial in s and t, in which the 
coefficient of st is 6V(Q1,Q,,Qz). On the other hand, (14.16) implies that this 


coefficient is 
2 2 


Fs DEO G1 +su+tz) = y" (Sv0l(Qe) 


dx? 


z=y'Sz. 
\e=y 


r=1 
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Hence 


(14.18) V(Q1,Qy.Qz) = ay"Se. 


After this preparation, we are able to prove one of the main results in this 
section, describing the signature of the matrix Sp of the canonical braced stress on 
a 3-polytope P. 


Proposition 14.14. If P is a 3-polytope containing the origin in its interior, then 
its canonical stress matriz S = Sp has exactly three zero eigenvalues and exactly 
one positive eigenvalue. 


Proof. From the stress equation US = 0 (where U is the matrix whose columns 
are the vertices of P), it follows that S has at least three zero eigenvalues. By 
Lemma [14.8] the largest eigenvalue of $ is positive with multiplicity 1. Thus it 
cannot be the zero eigenvalue, which means that S has at least one positive eigen- 
value. 

The difficult part of the proof is to show that S has only one positive eigen- 
value. We may assume that P is simplicial (every facet is a triangle), since every 
nonsimplicial polytope can be approximated by simplicial ones, and Sp depends 
continuously on P. (It takes some work to make this argument precise; see Exercise 
14.13). Then P* = Q; is simple. 

We invoke the Alexandrov—Fenchel Inequality (C.7) for mixed volumes: 


V(Q1, Qi, Os) = V(Qi ’ Qi, Q1)V(Q1 ’ Qy, Way): 
By (14.18), this gives that 
(y"S1)? > (y"Sy)(1"S1) 

for every proper vector y. This holds in particular for every vector sufficiently 
close to 1. Choose y = 1+ez, where z € RY is orthogonal to Sl and ¢ > 0 is 
small enough, then y'S1 = 1'S1 and y'Sy = 1'S1+e?z'Sz. Substituting and 
simplifying, using that 1'S'1 < 0 by (14.13), we get that z™Sz < 0. Since this holds 
for all vectors in a subspace of dimension n—1, it follows that S has at most one 
positive eigenvalue. 


Once we know this, Theorem|14.12}implies that S has exactly three zero eigen- 
values. 


The previous proposition is a “self-refining” result: by a relatively simple ar- 
gument (basically, repeating the last step in the proof of Theorem|14.4) it extends 
to every braced stress. 


Theorem 14.15. If P is a convex polytope in R® containing the origin in its 
interior, then every braced stress matrix with positive stresses on its skeleton has 
exactly one positive eigenvalue and exactly three zero eigenvalues. 


Proof. It follows just as in the proof of Proposition [14.14] that every braced 
stress matrix Q with positive values on the edges has at least one positive and at 
least three zero eigenvalues. Furthermore, if it has exactly one positive eigenvalue, 
then it has exactly three zero eigenvalues by T heorem [14.12] 

Suppose that @ has more than one positive eigenvalues. The braced stress 
matrix Sp has exactly one. All the matrices Q; = tSp+(1—t)Q are braced stress 
matrices for the same vector labeling and with positive values on the edges, and 
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hence they have at least four nonnegative eigenvalues. Since the eigenvalues are 
continuous functions of the matrix, there is a smallest number t > 0 for which Q; 
has at least 5 nonnegative eigenvalues. Then 1; must have exactly one positive 
and at least four zero eigenvalues, contradicting Theorem [14.1] 


Remark 14.16. Most of our discussion in the last section extends to higher di- 
mensional polytopes. The canonical stress matrix of the skeleton of a polytope can 
be defined in every dimension. For us, however, only the 3-dimensional case will be 
relevant. 


Exercise 14.1. Prove that a stress matrix of a framework with at least two 
different vector labels is never well-signed. 


Exercise 14.2. Suppose that a framework has a stress. (a) Prove that applying 
an affine transformation to the positions of the nodes, the framework obtained has 
a stress. (b) Prove that applying a projective transformation to the positions of 
the nodes, so that no node is mapped onto an ideal point, the framework obtained 
has a stress. 


Exercise 14.3. Let P be a convex polyhedron in R?, and let G be a planar graph 
embedded in the surface of P, with straight edges. Prove that every stress on G 
is a sum of stresses, each of which is supported on edges of G on a single face of 


P |Whiteley 1984). 


Exercise 14.4. Let S = {u,...,un} C R*. We say that S is in isotropic 
position, if }>, u; = 0 and do, uju) =I. 

(a) Prove that every finite point set not in an affine hyperplane can be trans- 
formed to isotropic position by an appropriate affine transformation. 

(b) Let the nodes of Kn be labeled by a set {u1,...,Un} C R® in isotropic 
position. Define $;; = uju;+1/n (i #7). Prove that (with appropriate diagonal 
entries) S defines a stress matrix on (Kn, u), and it is negative semidefinite. 


Exercise 14.5. Let (G,u) be a rubber-band-and-spring framework, and let S 
be a G-matrix. Prove that S is a stress on (G,u) if and only if changing each 
position u; by at most ¢, the energy changes by O(c”). Use this assertion to give 
an alternate argument for the conclusion in the proof of Theorem [14.4] 


Exercise 14.6. Let P be a simple 3-polytope (every vertex has degree 3) contain- 
ing the origin in its interior. Prove that the braced stress on its edges is unique 
up to scaling. 


Exercise 14.7. Let P be a polytope and let S be the canonical stress on P. (a) 
Prove that the volume of the convex hull of the face i of P* and the origin is 


§ 2»; Sij- (b) Prove identity (14.13). 


Exercise 14.8. Let P be a convex polygon in the plane, containing the origin 
in its interior. (a) Prove that its edges carry a braced stress that is negative on 
the edges. (b) Find a formula for this stress. (c) Prove that the stress matrix has 
exactly one negative eigenvalue. 


Exercise 14.9. We have seen that the square with diagonals with the stress 
shown in Figure a) is stable with respect to the positions of the vertices. It 
is however, very unstable with respect to the stress: if we consider the edges as 
rubber bands and springs, and change any of their strengths arbitrarily little, it 
will either collapse or fly off to infinity. If the diagonals are bars (or struts), then 
changing the strength of any of the rubber bands (representing the edges of the 
square) by a little amount, changes the equilibrium position only a little. 
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Exercise 14.10. Let G be a connected graph, let M be a well-signed G-matrix 
with exactly one negative eigenvalue, and let « € RY be a vector satisfying Max < 
0. Then supp™ (zx) is nonempty and induces a connected subgraph of G. 


Exercise 14.11. Let G be a connected graph, let M be a well-signed G-matrix 
with exactly one negative eigenvalue, and let « € RY be a vector satisfying Ma < 
0. Then supp™ (a) is nonempty, and either it induces a connected subgraph of G 
or there is a subset S C V, such that G\ S has at least two connected components 
Gi,...,Gn, and there exist nonzero, nonnegative vectors z1,...,2r,y € RY such 
that supp(z:) = S;, supp(y) C S, May = Map =.--- = Ma, y, and « = 
Yo, aizi, where }>; a; > 0 and at least two a; are positive. 


Exercise 14.12. Let P be a convex polygon in the plane, such that the origin 
is contained in its exterior, and no edge is collinear with the origin. (a) Prove 
that its edges carry a braced stress. (b) On which edges is it negative? (c) Prove 
that (switching the sign of all stress values if necessary) the number of negative 
eigenvalues is equal to the number of edges of P visible from the origin. 
Exercise 14.13. Let P’ (ft = 0,1,2,...) be convex 3-polytopes with vertices 
vi,...,v4, containing the origin in their interiors. Suppose that vi — v? as 
t — oo for every 1 <i<n. Prove that Sp: + Spo entry-by-entry. 

Exercise 14.14. Let P be a 3-polytope containing the origin in its interior, and 
let y and z be proper vectors in the sense of Section [14.4.2] Prove that y+z is 
proper, and Qy+2 = Qy+Qz. 

Exercise 14.15. If a braced stress on a 3-polytope is nonnegative on every edge 
(but may be zero), then the corresponding stress matrix has at most one positive 
eigenvalue. 
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CHAPTER 15 


Rigidity and Motions of Frameworks 


After the previous, somewhat special and technical chapter, we turn to a practi- 
cally important as well as graphically inspiring topic, namely rigidity of frameworks. 
We replace the edges of a graph by rigid bars (instead of rubber bands and springs 
as before). This way we obtain a physical model related to rubber bands, but more 
important from the point of view of applications. This topic has a vast literature, 
dealing with problems in architecture, engineering, robotics, and (replacing the ref- 
erence to rigid bars by distance measurements) in the theory of sensor networks. 
We have to restrict ourselves to a few basic results of this rich theory; see e.g. 
or for more. 

In the introduction to the previous chapter we have seen a whole arsenal of 
physical gadgets that we can imagine in place of the edges: rubber bands, ca- 
bles, bars, struts and springs. The difference between these with respect to static 
properties was little (the sign of the stress). We could study dynamic properties of 
structures composed of any combinations of these, but we will restrict our attention 
to bar-and-joint frameworks. 

We will look at these frameworks with two (slightly) different goals in mind. 
From one aspect, we specify the locations of the nodes in space (which determines 
the lengths of the edges geometrically). We then ask whether the resulting frame- 
work is rigid. This is the main use of these ideas in architecture: we specify the 
geometry of the framework and the rigidifying beams, and then want to make sure 
that the framework does not collapse; we want to compute the forces acting along 
the beams etc. From the other aspect, we are given the nodes and edges, along 
with the lengths of the edges, and then we are interested in all possible ways this 
framework can move. This aspect was very important in the 19th century, in the 
design of mechanisms (see Example (15.1); these days, it is extensively studied for 
further applications, among others, in the design of robot arms. 


Example 15.1 (Peaucellier—Lipkin linkage). Perhaps the first time when 
frameworks and their motions received a lot of interest came after the discovery 
of the steam engine and other mechanical devices in the 18*" and 19 century. 
One important goal was to guide a point back and forth on a straight line segment. 
Figure[15.1] shows one of the first solutions of this problem, the Peaucellier—Lipkin 
linkage. Note the sophistication needed for the solution of this simply sounding 
problem. 4 


A graph G with a vector-labeling in R®@ will be called a framework, where the 
label is considered as the “position” of the node. A graph with an edge-length 
assigned to each edge will be called a linkage. Every framework (Gu) defines a 
linkage (G, 2); we can say that to get a linkage from a framework we forget the 
positions of the nodes and only remember the lengths of the edges. The linkage 
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FIGURE 15.1. The Peaucellier—Lipkin linkage. The black joints 
are nailed, the white joints are free to move. The lengths of the 
bars satisfy AB = BC = CD = DA and QB = QD. Nodes A 
and C are inverses relative to the fixed circle centered at Q with 
radius \/QB?— AB? (prove!). As A moves along a circle through 
the center of the inversion, its inverse C’ moves along a straight 
line. 


(G, 0) is realizable in R¢ if it is defined by a d-dimensional linkage, i.e., there is 
a vector-labeling u: V — R®@ so that ¢(ij) = |u;—u,| for every edge ij. The 
framework (G,u) is called a realization of the linkage (G, ¢). 

Most of the time we assume that the framework is not contained in a lower 
dimensional affine subspace, in which case we call it full-dimensional. Two frame- 
works (G,u) and (G,v) with the same underlying graph G in d-space are called 
congruent, if there is a congruence (isometric transformation) of the whole space 
that maps u,; onto v; (¢ € V). Basically, we do not want to distinguish congruent 
frameworks; but a warning is in order that congruence will be a nontrivial issue in 
some considerations. 

Sometimes we want to “factor out” congruences. This can be defined as iden- 
tifying congruent realizations of the same linkage, but a more explicit description 
of the resulting factor space will be useful. Let us assume that the framework is 
full-dimensional, and consider an ordered set of nodes, say S' = (0,1,...,d), whose 
positions are affinely independent. Shift ug to the origin; rotate the space about 
0 so that uy is moved to the positive half of the x,-axis; then rotate the space 
about the x; axis so that ug is moved to the upper halfplane of the (1, x2)-plane 
etc. We end up with a vector labeling that is congruent to the original, and for 
0 <i<d the i-th coordinate of u; is positive, and its later coordinates are 0. We 
call this procedure fitting S to the coordinate system. This way we may assume 
that altogether d+ (d—1)+---+1 = Ce) coordinates of the node positions are 
zero. (This number is the same as the dimension of the manifold of congruences of 
the Euclidean d-space.) 


15.1. Versions of rigidity 


There are several versions of the notion of rigidity of a framework (G,u) 
(u: V>R?). 

e Is there a “proper motion” or “flexing” in the sense that we can specify a 
smooth orbit u,(t) (¢ € [0, 1]) for each node i so that u;(0) = u;, and |u;(t) — u,(t)| 
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is constant for every pair of adjacent nodes, but not for all pairs of nodes? If no 
such motion exists, we say that the framework is locally rigid or simply rigid. 
Structures which are not locally rigid will be called flexible; sometimes they are 
also called mechanisms. 

We have to make a technical remark here: We have defined a flexing as a 
smooth motion of the nodes, but it would be natural to be more general and 
allow continuous but not differentiable motions, or to be more restrictive and allow 
only those motions that are analytic functions of time. As a very weak condition, 
we could ask for the existence of a noncongruent realization of the same linkage 
arbitrarily close to the original. It turns out that all these notions of flexibility are 
equivalent (see e.g. [Connelly 1993}). 

e Is there a different realization of the same linkage: another representation 
v: V +R? such that |v;—v,| = |u;—u,| for every pair of adjacent nodes, but 
not for all pairs of nodes? (The representation v may or may not be obtained from 
the representation u by a continuous motion preserving the lengths of the edges.) 
If the framework has no different realization, it is called globally rigid. It is clear 
that this implies local rigidity. Further (non)-implications between these notions 
are illustrated in the examples below. 

e Suppose that a framework is rigid in R%; does it remain rigid, if we em- 
bed it (isometrically) in a higher dimensional space? We say such a framework is 
universally rigid. A single triangle in the plane has this property; two triangles 
attached along an edge form a rigid framework in the plane but not in 3-space. 
Universal rigidity is in various ways closely related to global rigidity; we refer to 


Gortler—Thurston 2014) and |Connelly—Gortler-Theran 2016a]. 


e It may happen that we can assign velocities to the joints so that they do 
not seem to stretch or shrink any edge, but such velocities cannot be continued 
to an actual motion (unless they correspond to a rigid motion). We call such an 
assignment of velocities an “infinitesimal motion”, and if no such motion exists, we 
say that the framework is infinitesimally rigid. 

We can give an engineering interpretation of infinitesimal motions: Suppose 
that we move each node 7 by a (small) distance of at most ¢, so that this is not a 
rigid motion of the framework. Trivially, the length of any edge i7 changes by O(e). 
It can happen that for such a nonrigid motion the length of each edge changes by 
O(e?) only; in this case, we cannot discover “in first order” that the length changes 
at all. From a practical point of view, this may be a problem, since in practice the 
bars are never totally rigid, and they may not hold against an infinitesimal motion. 


Example 15.2. Consider a complete graph K,,. Trivially every representation of it 
is globally rigid, and also locally rigid. However, it may have infinitesimal motions 
other than congruences: if all nodes are contained in an affine hyperplane, assign 
to each node an arbitrary velocity vector orthogonal to this hyperplane. 4 


Example 15.3. Have a look at Figure|15.2| which shows 2-dimensional realizations 
of the same graph, rigid in one sense but not in the other. ¢ 


Example 15.4 (Triangular Prism IT). Let us return to the triangular prism 
in Example represented in the plane. From we get that dim(Inf) — 
dim(Str) = 2-6—9 = 3 = dim(Rig) (assuming that not all nodes are collinear in 
the representation), and hence a representation of the triangular prism in the plane 
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(a) (b) (c) 


FIGURE 15.2. Structure (a) is infinitesimally rigid and locally rigid 
in the plane, but has a different realization (b), and so it is not 
globally rigid. Considering this as a motion in 3-space, we see 
that this framework is not universally rigid. Structure (c) has no 
other realization, so it is locally, globally and universally rigid, but 
it is not infinitesimally rigid, since vertical motion of the middle 
point is an infinitesimal motion. (Note that motion by ¢€ results in 
stretching the edges by O(e?). 


has a noncongruent infinitesimal motion if and only if it has a stress. This happens 
if and only if the lines of the three edges connecting the two triangles pass through 
one and the same point (which may be at infinity; Figure [15.3] left and right). 
Not all of these representations allow flexing, but some do. In the first, fixing 
the dark nodes, the white triangle can “circle” the dark triangle. This framework 
has a stress (can you find it?). The second is infinitesimally rigid and stress-free, 
but it is not globally rigid (reflect the white triangle in a symmetry axis of the black 
triangle); the third has an infinitesimal motion (equivalently, with this number of 
edges, it has a stress, as we will see), but it is globally rigid (and so it has no 
flexing). 4 


FIGURE 15.3. Rigid and nonrigid representations of the 3-prism 
in the plane. 


15.2. Infinitesimal motions 


We will continue with the exact definition and study of infinitesimal rigidity, 
which was perhaps the most awkward to describe, but easiest to handle. 

Suppose that the nodes of the graph move smoothly (in continuously differ- 
entiable way) in d-space; let u;(t) denote the position of node i at time t, where 
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u; = u;(0) is the starting position. The fact that the bars are rigid says that 
for every edge ij € E, the edge-lengths ¢;; = |u;—u,| are fixed. Squaring these 
equations leads to polynomial equations 


(15.1) (u,(t) —u,(t)) '(us(t)—u,(t)) = @,. 


Differentiating, we get 


(15.2) (u,(t)—u,(¢)) "(a;(t)—a;(t)) = 0. 


In particular, 
(15.3) (u;—u,) "(u;(0) —a,(0)) =0. 


This motivates the following definition. Given a vector-labeling u: V > R?¢ of G, 
amapv: V +R‘ is called an infinitesimal motion of the framework (G, u), if the 
equation 


holds for every edge ij € E. The vector v; can be thought of as the “velocity” of 
node 7. 

Infinitesimal motions form a linear space Inf = Infgj C R¢XxV There are some 
trivial infinitesimal motions, called (infinitesimal) rigid motions or congruences, 
which are velocities of smooth families of congruences of the whole space. In this 
case, holds for every pair of nodes. Every infinitesimal congruence is the sum 
of two kinds of special solutions of equations (see Exercise[15.3): we can have 
v; = v; for all i and j (geometrically, this means translation of the framework), 
and v; = Au;, where A € R?*@ is a skew symmetric matrix (this corresponds to 
applying the smooth family of orthogonal transformations). 

Infinitesimal congruences of a framework form a subspace Rig = Rigg, C Inf. 
By the remarks above, Rig consists of all systems of velocities of the form 


(15.5) v; = Au; +y, 


where A is a skew symmetric matrix and v € R¢. If the node positions are not 
contained in an affine hyperplane, then this representation is unique. This implies 
that in this case dim(Rig) = d+ () = ewe 

In the language of differential geometry, congruences of the d-space form a 
manifold of dimension ear and infinitesimal congruences form the tangent space 


of this manifold, which is a linear space of the same dimension. 


15.2.1. Infinitesimal and finite motions. We say that the framework is 
infinitesimally rigid, if Inf = Rig, i.e., every infinitesimal motion is an infinitesi- 
mal congruence. The argument above, motivating the definition of infinitesimal 
motions, suggests the following: 


Lemma 15.5. If a framework is flexible, then it is not infinitesimally rigid. 


This looks trivial, since it would seem that if u(t) (0 < ¢ < 1) is a proper 
flexing, then v; = u;(0) gives an infinitesimal motion by (15.3). However, there is 
technical difficulty here: (15.3) may be a trivial equation if all the velocities u,;(t) 
are zero at t = 0. It could also happen that u(0) is an infinitesimal congruence, 
even though the flexing is proper (see Example[15.6). 
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FIGURE 15.4. Left: a proper flexing whose velocities at time 0 
form an infinitesimal congruence. Right: A proper flexing where 
velocities at positions at times converging to 0, when normalized, 
can converge to an infinitesimal congruence, but also to a proper 
infinitesimal motion, depending on when the framework is ob- 
served. 


Example 15.6. Consider a smooth proper flexing (G, u(¢)) as above, but let us 
modify it by slowing down and translation: we consider (G ; v(t)), where 


vi(t) = u;(t?) +tv 
with some fixed vector v. Then 
Vv; (0) =Vv, 


which is an infinitesimal congruence (Figure [15.4] left). 

Perhaps we should construct an infinitesimal motion not as the velocities at 
time 0, but as a limit of velocities at times converging to 0, appropriately scaled? 
The example on the right of Figure as a framework in 3-space, should be 
another warning. We can think of this as a pendulum circling its vertical position 
and slowing down, starting at time —1. The geometry and timing are arranged so 
that the k-th arc has length ~ 4~*, and it is traversed in time 2~*, so the velocity 
tends to 0, which is attained at time 0. (If you wish, you can continue to positive 
times just reversing the process.) 

If we look at velocities when the pendulum is closest to us, then we see velocities 
of a noncongruent infinitesimal motion. However, if we look at the velocities when 
the pendulum is (say) in a leftmost position, then we see the velocities of a rigid 
motion. How to choose the “right” moments in time? 4 


Proof of Lemma Our first step is to modify the flexing (G, u(t)) in 
order to eliminate rigid motions as much as possible. Fixing a time ¢, let u; = u,(t) 
and U; = u;(0), and consider (G, u) and (G, 1) as two different frameworks. We fix 
(G,U), and rigidify (G,u), but let it float freely. Connect node u,; to node U; by 
a rubber band for each node 7, and let the resulting structure find its equilibrium. 
In other words, apply an isometric transformation to (G,u) so that the energy 
>>, \ui—U;|? is minimized. 

The forces keeping (G, u) in equilibrium (as a rigid body) are the vectors v; = 
v(t) = u;—U;, acting at the points u;. By classical physics, 


(15.6) So vi =0, 


iEeV 
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and 
(15.7) Sv) Au; = 0 

ieV 
for every skew symmetric matrix A € R¢¢. Also note that since (G,u) and (G, @) 
are the same as linkages, we have |u;—U,|* = |u;—u,|? for every edge ij. This 
translates into the equation 
(15.8) 2(u; —U;) "(vi —v;) = —|v;-v;|?. 


Let w, = maxjey |v;|. If (G,t) is not isometric to (G,u), then uw, > 0, and 
the vectors (1/w;)v; have length at most 1. Let us select a sequence of such times 
converging to 0, then w, — 0. By selecting a subsequence, we may assume that 
(1/wz)vi(t) > w; for some vectors w;. Clearly max; |w;| = 1 and so w £4 0. 

We claim that w is an infinitesimal motion of (G,w). Indeed, implies 
that 


1 1 1 
BT 2 
u; —U; Vv; vjJ= vi-v;|" = O(w 
(Hi —4))" (vi - vs) =~ lve vil? = Own), 
and so in the limit we have 

We have to argue that w is not an infinitesimal congruence of (G,U). Suppose 
it is, then there is a vector v € R@ and a skew symmetric matrix A € R?*4 such 
that w; = v+ Au;. We may assume that }7, u; = 0, since we can choose the origin 
at the center of gravity of the points 0;. Scaling by uw; and going to the limit, we 


see that the vectors w; satisfy the equilibrium conditions (15.6) and (15.7). The 
first condition gives us that 


0= So wi = nv+ )— Au; = nv, 
ieV ieV 
so v = 0 and w; = Au;. Applying the second condition with the same matrix A, 


we get 
0= 5 > w/ Au; = 5_u/A‘Au, = 5 |Aui|?. 
ieV ieV ieV 
This implies that At; = 0 for every i, and so w; = 0. But we know that w 4 0, a 
contradiction. 


The converse of this lemma is not true in general, as Figure shows. How- 
ever, the converse does hold for stress-free frameworks: 


Lemma 15.7. If a stress-free framework is not infinitesimally rigid, then it is 
flexible. 


Proof. Let (G,u) be a stress-free framework in R?, and let v be a non-rigid 
infinitesimal motion. Since v is not a rigid infinitesimal motion, there is a pair (a, b) 
of nonadjacent nodes for which (ug —uy)'(va — vp) 4 0. We may assume that 


(15.9) (ug —u,) "(va —va) = 1. 


Equations give linearly independent equations for v, and is linearly 
independent from them, since the other equations have zero right hand side. In the 
solution space of and (15.9), we can fix some entries of v to zero to have a 
unique solution. By Cramer’s Rule, the remaining entries of v can be expressed as 
rational functions of the entries of u, and these rational functions will satisfy 
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and (15.9) in a neighborhood of u. Scaling up with the common denominator, we 
get polynomials w;(u) in the entries of u such that 


(xi —x,)"(wa(x) —w;(x)) =0 
identically in x € R¢*” for every edge ij € E, and 


(Xa — xb) '(Wa(x) —Wws (x)) #0 
if x is in a neighborhood of u. 
Consider the differential equation system 
g 
dt 
Since w(x) is a continuous function of x, this system has a solution in an interval 
0<t<T. This defines a nonrigid motion of the graph. Indeed, for every edge 77, 
d Ty. ‘ 
<li (t) — xy (t)|? = (xi(t) x, (€)) (X(t) — X;(2)) 


= 
= (xi(t)—x; (4) (wi(x(¢)) —wy(x(4))) = 0, 

so |x,;(t)—x,(t)| remains constant. On the other hand, this is not a rigid motion, 

since 


x;(t) = wi(x(t)), x(0) =u. 


— [Xa (t) — x0 (#)|? |e=o= (Wa — uv) "(wi(u) — w;(u)) 4 0, 


so |Xq(t) —x»(t)| cannot remain constant. 


15.2.2. Infinitesimal motions and stresses. Which subsets of the equa- 
tions in (15.4) are linearly independent? Let ;; (ij € E) be multipliers such that 
combining the equations with them we get 0. Considering the coefficient of v;, this 


means that 
S- Aij (uy — uz) = 0 
JEN (i) 
for every node i. In other words, 4 is a stress! 

Recalling the notion of transversality, and that equations were obtained 
by differentiating equations (15.1), we can assert that a solution of is 
transversal if and only if the framework (G,u) is stress-free. This means that 
stress-free frameworks can be studied by similar methods as transversal orthogonal 
representations. For example, the Implicit Function Theorem can be applied to 
show that if a linkage (G,¢) has a stress-free realization, then so does every linkage 
(G,’) for edge-lengths ¢' sufficiently close to &. 

There is a certain “dual” to this observation about linear dependencies between 
the equations defining infinitesimal motions. What are the linear dependencies 
between the equations defining a stress (where the values Sj; are considered 
as unknowns)? Combining the d equations belonging to the same node i means 
taking inner product with a vector w;. Combining all results in one equation, we 


obtain 
S- a Sij(u jy)" Wi wi = 0, 


i€V GEN(i 
or 
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If this is a linear dependency between the stress equations, then the coefficient of 
every 5;; must be zero. Thus w is an infinitesimal motion! 

Let us re-write the basic equations in a form that is sometimes more 
useful. We combine the velocity vectors v; into a single vector v € R"¢, then 
(15.4) provides a system of homogeneous linear equations for the entries of v. The 
matrix R = Rou € R™*"4 of this system is defined for an edge ij, a node k and a 
coordinate 1 < v < d by 


Uiv — Ujv, if k =i, 
(15.10) Rijpv = Ujy — Uiv, if k => J; 
0, otherwise. 


We call R the rigidity matrix of the framework (G,u) (Figure|15.5). 
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FIGURE 15.5. A framework in dimension 2 and its rigidity matrix. 
The framework has 5 nodes and 7 = 2-5—3 edges. It has both a 
nonzero stress and a nonrigid infinitesimal motion. 


Infinitesimal motions (combined into single vectors v € R"?) form the right 
null space of R, while stresses on (G,u) form the left null space of R. This implies 
that 


(15.11) rk(R) = dn—dim(Inf) = m—dim(Str). 


From this we see that if a framework spanning R®@ is stress-free, then m < 
dn— Ce. and if it is infinitesimally rigid, then m > dn— Cel. In the (most 
interesting) case when m = dn— aw the framework is infinitesimally rigid if and 
only if it is stress-free. Such frameworks are minimally infinitesimally rigid as well 
as maximally stress-free (corresponding to the subsets of edges that are bases of 
the row space of R(K,,u)). 

There is another useful meaning of the rigidity matrix. Fix a graph G, and 
consider the squared-edge-length map f = fg: R¢*” > R® defined by 


(15.12) f(u) = (ju;—uj|?: ij € EB). 
It is easy to compute that the Jacobian Df(u) of f is just the rigidity matriz Rev. 


15.2.3. Rigidity of 3-polytopes. To conclude this section, let us illustrate 
the connection between stresses and infinitesimal rigidity by an important example. 
Let G be the skeleton of a convex 3-polytope P, with the representation given by the 
positions of the vertices. Cauchy’s Theorem [14.5] tells us that the space of stresses 
has dimension 0, so we get that the space of infinitesimal motions has dimension 
dim(Inf) = 3n—m. We know that dim(Inf) > (3) = 6, and hence m < 3n—6. Of 
course, this simple inequality we know already (recall (2.2)). It also follows that if 
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equality holds, i.e., when all facets of P are triangles, then the space of infinitesimal 
motions is 6-dimensional; in other words, 


Proposition 15.8. The framework consisting of the vertices and edges of a convex 
3-polytope with triangular facets is infinitesimally rigid. 


If not all facets of the polytope P are triangles, then it follows by the same 
computation that the skeleton is not infinitesimally rigid. However, if the facets are 
“made out of cardboard”, which means that they are forced to preserve their shape, 
then the polytope will be infinitesimally rigid. To prove this, one can put a pyramid 
over each facet (flat enough to preserve convexity), and apply Proposition [15.8] to 
this new polytope P’ with triangular faces. Every nontrivial infinitesimal motion 
of P preserving the shape of the facets would extend to a nontrivial infinitesimal 
motion of P’, but we know already that P’ has no such motion. 

Proposition [15.8] does not remain true if we only assume that the graph is iso- 
morphic to the skeleton of a convex 3-polytope with triangular facets; see Exercise 
[15.6] for a classical example [Bricard 1897]. It even fails to hold when the framework 
consists of the vertices and edges of a simple polytope (the surface of the polytope 
is homeomorphic to the sphere), as shown by a more involved counterexample 


Connelly 1977). 


15.3. Generic frameworks 


In Chapter |3] we have met vector-labelings that are in general position; in 
Chapter [10] another related condition, transversality was introduced. Here we need 
a further nondegeneracy condition: we say that a vector-labeling is generic, if all 
coordinates of the representing vectors are algebraically independent over Q. 

This assumption is more restrictive than general position, and it is usually an 
overkill, since we could always replace it by “the coordinates of the representing 
vectors do not satisfy any algebraic relation that they do not have to, and which is 
important to avoid in the proof”. The hypothesis of general position will be used in 
the next section on global rigidity again. Transversality is equivalent to the basic 
condition of stress-freeness, and so both of these related but weaker notions have a 
role to play in rigidity theory. 

In a generic framework, all entries of the vector labels are algebraically indepen- 
dent, by definition. However, while isometric transformations of the whole space 
do not change stress-freeness, and do not change the edge-lengths, they may create 
algebraic dependencies between the coordinates. For example, if we fit a set of d+1 
nodes to the coordinate system, then out of dn coordinates, (“5') become zeros, 
which means that we are left with dn — ee algebraically independent coordinates. 

We say that a graph is generically stress-free in R®, or simply stress-free in R@, 
if every generic representation of it in R®@ is stress-free; similarly, we say that G is 
generically rigid in R¢, if every generic representation of it in R@ is infinitesimally 
rigid. It is enough to check these properties for a single generic representation: if one 
generic labeling is stress-free or infinitesimally rigid, then so is every other generic 
labeling. More generally, the following fact was observed by Gluck [Gluck 1974]: 


Proposition 15.9. If a graph G has a vector labeling in R@ that is infinitesimally 
rigid [stress-free], then G is generically rigid [stress-free] in R¢. 


Proof. Both infinitesimal rigidity and stress-freeness can be expressed in terms 
of the nonvanishing of certain determinants composed of the coordinates of the 
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representation. If such a determinant vanishes for one algebraically independent 
choice of the coordinates, then it vanishes for every choice. 


The assumption that the coordinates are algebraically independent numbers 
seems very far from any real-life situation; it is quite technical and unverifiable, 
and for this reason it seems useless. However, another way of handling the generic- 
ity assumption brings genericity quite close to applications: this is to choose inde- 
pendent random values for the coordinates uniformly from an interval (or indeed, 
from any distribution that is absolutely continuous with respect to the Lebesgue 
measure, like Gaussian). We call the vector labeling obtained this way a random 
vector labeling of G. The random labeling is generic with probability 1, and we can 
re-state our results as probabilistic statements holding with probability 1. (Mathe- 
matically, the definition in terms of algebraic independence yields slightly stronger 
results. ) 

By this argument, we can state: 


Proposition 15.10. If G is generically rigid [stress-free] in R¢, then a random 
vector labeling of G is almost surely infinitesimally rigid [stress-free] in R@. 


15.3.1. Stress-free and rigid graphs in the plane. Whether a graph is 
generically rigid or stress-free is a purely combinatorial property, and one would like 
to obtain a purely combinatorial characterization of these properties. It is surprising 
that this basic question is only solved in the planar case, which we present here. 
We start with stress-freeness. 

It is clear that if a framework (G,u) has more than 2n—3 edges, then it cannot 
be stress-free (recall (15.11)). We can apply this observation to induced subgraphs: 
if some subset of k > 2 nodes spans more than 2k—3 edges, then the framework 
cannot be stress-free in the plane. So a natural criterion for a graph G to have a 
stress-free representation is the following: 


(GL) Every set of k nodes (k > 2) spans at most 2k—3 edges. 


Note that this condition implies that the graph has no multiple edges. The 
necessity of this criterion was recognized already in the 19-th century, and it was 
stated as a “rule of thumb” for the existence of a stress-free realization in text- 


books |Foppl 1880]. The sufficiency was proved by |Pollaczek-Geiringer 1927|, and 
rediscovered by [Laman 1970]. We call (GL) the Geiringer-Laman condition. 


The Geiringer-Laman condition tells us why a graph is not stress-free. To 
formulate a (general) reason why a graph is stress-free, we introduce the Henneberg 


construction, |Henneberg 1911] which is an iterative procedure, starting with a 


single node, and increasing the number of nodes of a graph G in one of two ways 


(Figure (15.6): 


(H1) Create a new node and connect it to at most two old nodes. 


(H2) Subdivide an edge and connect the new node to any third node. 


The following important theorem, combining the results of |Henneberg 1911], 
Pollaczek-Geiringer 1927) and |Laman 1970) describes stress-free graphs in the 


plane. 
Theorem 15.11. For a graph G, the following are equivalent: 
(a) G is stress-free in the plane; 


(b) G satisfies the Geiringer-Laman condition; 
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FIGURE 15.6. The Henneberg construction 


(c) G can be obtained by the Henneberg construction. 


Conditions (b) and (c) show that the property of a graph of being (generically) 
stress-free in the plane is in NPNco-NP. The fact that it is in P follows by Corollary 
15.17| below, using algorithms for the decomposition of a graph into connected 


spanning subgraphs. (See also Section}19.3.2}) 


Proof. (a)=(b): This we have argued above. 


(b)=(c): We prove by induction on the number of edges that every graph 
G satisfying the Geiringer-Laman condition can be built up by the Henneberg 
construction. Since the average degree of G is 2m/n < (4n—6)/n < 4, there is a 
node i of degree at most 3. If i has degree at most 2, then we can delete 7 and 
proceed by induction and (H1). So we may assume that 7 has degree 3. Let a,b,c 
be its neighbors. 

Call a set S C V spanning exactly 2|S|—3 edges a tight set. Clearly |S| > 2. 
The key observation is: 


Claim. I[f two tight sets have more than one node in common, then their union is 
also tight. 


Indeed, suppose that 5; and 5 are tight, and let Hy = E(G[S;]). Then £19 Ep 
is the set of edges spanned by $1.S2, and so |, E2| < 2|51,9.$2|—3 (using the 
assumption that |.S1;S2| > 2). The set £,U > is a subset of the set of edges 
spanned by $US , and so the number of edges spanned by $US is at least 


|E, U B5| — |B, |+|E£2|—|Ei 0 £5| > (2|S1|—3) + (2|S2| — 3) — (2|$1.N.$2| —3) 
= 2)5,U S| —3. 


Since G satisfies (GL), we must have equality here, implying that S;US$¥ is tight. 

We want to prove that we can delete + and create a new edge between two of its 
neighbors to get a graph G’ satisfying (GL); then G arises from G’ by Henneberg 
construction (H2). Let us try to add the edge ab to G\i; if the resulting graph 
G’ violates (GL), then there is a set S C V\ {i} with || > 2 spanning more than 
2|S'|—3 edges of G’. Since S spans at most 2|S]—3 edges of G, this implies that S 
is tight, and a,b € S. (If a and b are adjacent in G, then {a, b} is such a tight set.) 
If there are several sets S with this property, then Claim 1 implies that their union 
is also tight. We denote by $,, this union. Similarly we get the tight sets S,, and 
Seas 
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The set S,, cannot contain c, since then adding 7 to Sa», condition (GL) would 
be violated. Similarly, b ¢ Sa. and a ¢ 5}.. In particular, the sets S,,, Spe and Sea 
are distinct, and hence the Claim implies that any two of them have only one node 
in common. Thus their union S = S,pUSpcU Sea U {i} spans at least 


(2| Sav] — 3) + (2 See] — 3) + (2] Seal —3) +3 = 2|$|—3 


edges, and so it is tight. This contradicts the maximality of S,», and completes the 
proof of (b)=(c). 

(c)=(a): We use induction on the number of edges. Let G arise from G’ by 
one step of the Henneberg construction, and let S be a nonzero stress on a generic 
framework (G,u). We want to construct a nonzero stress on (G’,u) (which will be 
a contradiction by induction, and so it will complete the proof). 

First suppose that we get G by step (H1), and let i be the new node. It suffices 
to argue that S is 0 on the new edges, and so it gives a stress on (G’,u). If i has 
degree 0 or 1, then this is trivial. Suppose that 7 has two neighbors a and b. Then 
by the assumption that the representation is generic, we know that the points u,, 
Uu, and uy are not collinear, so from the stress condition 


Sia (u; cs Ug) + Sip (ui; = Us) = 0 
it follows that S;, = Sj, = 0. 
Suppose that G arises by the Henneberg construction (H2), by subdividing the 


edge ab € E(G’) and connecting the new node i to c € V(G’) \ {a,b}. Let us modify 
the representation u as follows: 


: (ug +up) if 7 =4, 
uj; otherwise. 


This representation for G is not generic any more, but it is generic if we restrict it 
to G’. 

If the generic representation u of G admits a nonzero stress, then by Proposition 
[15.9]so does every other representation, in particular u’ admits a nonzero stress S’. 
Consider the stress condition for i: 


Siq(W; — Wa) + Sip (Uy — Up) +.5;-(U; — Ue) = 0. 


Here uj—ul, = $(u,—u,) and uj—ul, = $(u,—u,) are parallel but uj—ul, is not 
parallel to them. So it follows that Si, =0 and Si, = S!,. Defining S’, = S’,, we 


1a? 
get a nonzero stress on (G’,u), a contradiction. 


Using these results, we turn to characterizing rigid graphs in the plane. Equa- 
tion implies that minimally rigid graphs (i.e., rigid graphs that cease to be 
rigid when any edge is deleted) are exactly the maximal graphs satisfying (GL), 
and they all have 2n—3 edges. In other words, 


Corollary 15.12. A graph with n nodes and 2n—3 edges is rigid in the plane if 
and only if it satisfies the Geiringer-Laman condition. 


Minimal stressed graphs (i.e., graphs that are not stress-free but all their proper 
subgraphs are) are also worth considering. 


Corollary 15.13. A graph G is minimal stressed in the plane if and only if it has 
2n—2 edges, but every proper subset S C V spans at most 2|S|—3 edges. 
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The Geiringer-Laman Theorem can be used to prove the following characteri- 
zation of rigid graphs (not just the minimal ones) in the plane. Let us say that G 
satisfies the partition condition, if for every decomposition G = G,U---UGz, into 
the union of graphs with at least one edge, we have 


(15.13) S°(2|V(Gi)|-3) > 2n—3. 


a 


Note that this condition implies that G has at least 2n—3 edges (considering the 
partition into single edges). 


Theorem 15.14. A graph G is rigid in the plane if and only if it satisfies the 
partition condition. 


Proof. To prove that the partition condition is necessary, consider any de- 
composition G = G,U---UG, into the union of graphs with at least one edge. 
Let E’ Cc E be a smallest rigid subset. We know that E’ is stress-free, and 
hence E’NE(G;) is stress-free, which implies that |E’NE(G;)| < 2V(G;)-3 
by Theorem (here we use the easy implication (a)=(b)). It follows that 
2n—3 = |E"| 235, E/N E(Gi)| < O,(21V(Ga)|-3). 

To prove the converse, suppose that G is not rigid, and fix any generic repre- 
sentation u of it. Let H be a maximal spanning subgraph of G satisfying (GL). 
Since H is not rigid, it has fewer than 2n—3 edges. Similarly as in the proof of 
Theorem [15.11] we call a subset S C V tight, if it spans 2|S|—3 edges of H (by 
(GL), S cannot span more edges of H than that). Let S1,...,5, be the maximal 
tight subsets of H. Since every adjacent pair of nodes is tight, the subgraphs H[;] 
cover H. Just like in the proof of T heorem [15.11] it follows that any two subsets 
S; have at most one node in common; in particular, the induced subgraphs G[S;] 
are edge-disjoint. 

For every edge e € £'\ E(H), the graph H +e does not satisfy (GL) any more, 
and so it contains a subset S C V spanning more than 2|.5|—3 edges of H+e. It 
follows that S' is tight, and e is spanned by S. So S C S$; for some i, and e is 
spanned by 5;. This means that G[S)]U...G[S;,] = G; since 


k 


k 
> 5 (215;|-3) = 3 |E(A[S;))| = |E(A)| < 2n—3, 


i=1 


this partition of G violates the partition condition. 


Corollary 15.15. Every minimal stressed graph is rigid in the plane. 


We can derive more graph-theoretic conditions on generic stress-freeness and 
rigidity in the plane by combining the Geiringer-Laman condition and the partition 
condition with classical theorems in and [Nash- Williams 1964]. We 
state these theorems here without proof; a proof will be easy using more general 


results in Section [19.3.2 


Theorem 15.16. Let G be a multigraph on n nodes. (a) The edges of G can be 
covered by k subtrees of G if and only if every subset S C V spans at k(|S'|—1) edges. 
(b) G contains k edge-disjoint spanning trees of G if and only if for every partition 
P of V, the number of edges connecting different classes is at least k(|P|—1). 


Using this theorem, we obtain the following corollary: 
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Corollary 15.17. A graph G is generically stress-free in the plane if an only if 
every graph obtained by doubling an edge of G can be covered by two forests. A graph 
G is generically rigid in the plane if an only if every graph obtained by doubling an 
edge of G contains two edge-disjoint spanning trees. 


We give a simpler necessary condition for rigidity in the plane as an application 


of Theorem ]15.14] |Lovasz—Yemini 1982). 


Corollary 15.18. Every 6-connected graph is rigid in the plane. 


Proof. Suppose that G is 6-connected but not rigid. Then by Theorem}15.14 
there exist subgraphs G; = (Vj, £;) (¢ =1,...,%) such that G = G,U---UG,, and 
k 
(15.14) S>(2|Vi|-3) < 2n-3. 
i=1 
We may assume ee every G; is complete, since adding edges inside a set V; does 
not change (15.14). The graph itself is trivially not complete. We may also assume 
that every nore a to at least two of the V;: deleting a node contained in one 
V; only preserves (15.14) and it is easy to see that it preserves 6-connectivity. 
We claim that 


(15.15) S- (2- a) >2 


Vidvu 
for each node v. Let (say) Vj,...,V,; be those V; containing v, |Vi| < |Va| <... 
The graph is 6-connected, hence v has degree at least 6, and so 
(15.16) de(Mil- 1) 2 6. 

i=1 
Each term on the left side is at least 1/2, so if r > 4, we are done. If r = 3, then 
|V3| > 3, and so the left side of (15.15) is at least 1/2+1/2+1 = 2. Ifr = 2 
and |Vi| > 3, then the left side of (15.15) is at least 1+1 = 2. Finally, if r = 2 
and |V;| = 2, then |V2| > 6 by (15.16), and so the left side of (15.15) is at least 
1/2+3/2 =2. 

Now summing (15.15) over all nodes v, we get 


~ X(2- Wy) = ¥(2- ma) 5 1= Demis 


veEV V;D3u 


on the left side and 2n on the right hand side, which contradicts (15.14). 


It is not enough to assume that G is 5-connected. Indeed, consider any 5- 
connected 5-regular graph H on k > 6 nodes, split each node into 5 nodes of degree 
1, and attach a copy of Ks to these nodes, to get a graph G (Figure 15.7). Clearly 
G is 5-connected. We claim it is not rigid in the plane. Indeed, from each copy 
of Ks we can remove 3 edges so that it remains rigid, and so the remaining graph 
G' = (V, E’) is rigid if G is. But n = 5k, and 


1 
|E"| = 7k+|E(H)| = 10k— 5k < 10k~3 = 2n—3, 


showing that G’ is not rigid in the plane. 
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FIGURE 15.7. Two 5-connected graphs that are not rigid in the plane. 


15.3.2. Generic rigidity in higher dimension. A 3-dimensional analogue 
of the Geiringer-Laman Theorem, or of Theorem is not known. It is not 
even known whether there is a positive integer k such that every k-connected graph 
is rigid in 3-space. One indication why an approach similar to the planar one fails 
can be illustrated by the following example. 


Example 15.19. Consider the graph G obtained by gluing together two copies of 
Ks along an edge ab, and then removing this edge (Figure (15.8). This graph is 
clearly not rigid in 3-space: in every generic representation u we can rotate one 
copy of Ks along the line u,uy while keeping the other fixed. On the other hand, 
its number of edges is 18 = 3-8— (@), which is “just right”, so this graph is stressed. 
But all of its subgraphs H have at most 3|V(H)|—6 edges, and so the existence of 
a stress on (G, u) does not follow by counting dimensions. This graph is a minimal 


stressed graph, showing that Corollary|15.15|/does not remain valid in 3-space. @ 


FicureE 15.8. A nonrigid graph in 3-space. 


Condition (GL) generalizes to any dimension, as a necessary condition for 
stress-freeness: 

(GL-d) Every set SCV with k =|S| >d spans at most dk— eae edges. 

Example |15.19] shows that this condition is not sufficient for d = 3 (and it is 
easy to generalize this counterexample to all d > 3). 

While a complete characterization of stress-free or rigid generic frameworks in 
higher dimension is not available (it may not exist at all), there are a number of 
interesting and highly nontrivial results, which we will discuss in the rest of this 
section. 
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Remark 15.20. Let us add a discussion of the complexity theoretic aspects of 
these characterizations. Violation of the Geiringer-Laman condition (GL) gives 
a simple certificate that a graph is not generically stress-free in the plane. Any 
sequence of Henneberg’s construction building up the graph is a certificate that the 
graph is generically stress-free. So generic stress-freeness is in NPMco-NP. One 
can use basic matroid algorithms to show that generic stress-freeness is in fact in 
P. Similar arguments can be given for generic rigidity of a graph in the plane. 

We noted that stress-freeness of a framework can be easily tested (in polynomial 
time) using elementary linear algebra. This is correct if the coordinates of the points 
are given as rational numbers. For a generic framework, “algebraically independent 
transcendentals” cannot be specified in a way to facilitate computations with them, 
and it is not known whether the problem of deciding whether a graph is generically 
stress-free in dimension 3 (or in any larger dimension) is in P. 

Generic stress-freeness of a graph is a property that is in NP for every dimen- 
sion. One can use the Schwartz—Zippel Lemma [3.12] similarly is in Section [3.3.3] 
to show that a generically stress-free graph has a stress-free realization with ratio- 
nal coordinates whose numerators and denominators have polynomially bounded 
numbers of digits. This realization can serve as certificate, whose validity can be 
easily checked. It is not known, however, how to certify that a generic realization 
is not stress-free; more generally, it is not known whether generic stress-freeness of 
a graph is in co-NP for dimensions 3 or larger. 


15.3.3. Genericity and finite motions. For generic frameworks, there is 


no difference between infinitesimal nonrigidity and flexibility |Asimow—Roth 1978]: 


Theorem 15.21. A generic framework is infinitesimally rigid if and only if it is 
locally rigid. 


Proof. Lemma [15.5] implies the “only if” part. The proof of the “if” part is 
quite similar to the proof of Lemma Suppose that the generic framework 
(G,u) is not infinitesimally rigid in R°®, we want to show that it has a proper 
flexing. The key step is to show that there is a vector labeling v whose entries can 
be expressed as polynomials of the node positions so that v(x) is an infinitesimal 
motion for every vector labeling x in a neighborhood of u, and v(u) is not a rigid 
infinitesimal motion. 

For every minimal set C’ of linearly dependent columns of the rigidity matrix 
Rf = Raw, there is a nonzero infinitesimal motion vc whose support is C. Note 
that vc is uniquely determined up to a scalar factor, and every infinitesimal motion 
is a linear combination of the vector labelings vc belonging to minimal dependent 
sets C of columns. Therefore there is a minimal dependent set C for which vc is 
not a rigid motion. From Cramer’s Rule it follows that the entries of vo can be 
chosen as polynomials w;(u) of the entries of u with integral coefficients. 

The equation (u;—u,)'(w;(u)—w;(u)) = 0 is an algebraic equation in the 
entries of u, and so the assumption that these entries are algebraically independent 
implies that it holds identically in u. In other words, if we define w,;(x) for every 
labeling x : V — R¢@ using the same polynomials as in vc, then and so w(x) is 
an infinitesimal motion of (G,x) for every x. Trivially, v = w(u) is not a rigid 
infinitesimal motion. 

From here, we conclude just as in the proof of Lemma [15.7] 
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15.3.4. Genericity and edge lengths. The following theorem, adopted 


from |Connelly 2005}, connects edge-lengths and stress-freeness. 


Theorem 15.22. (a) A generic framework is stress-free if and only if its edge- 
lengths are algebraically independent. (b) Every framework with algebraically inde- 
pendent edge-lengths is stress-free. 


Proof. (a) Assume that the edge-lengths are algebraically dependent; then 
there is a polynomial F(x, : e € E) with rational coefficients that is not identically 
zero, but 


(15.17) F(..., /uj—uj|?,...) = 0 


(where |u;—u,|* is substituted for the variable x;;, ij € E). Consider such a poly- 
nomial F with minimum degree. Since the coordinates of the u; are algebraically 
independent, holds identically for any choice of representing vectors u. Dif- 
ferentiating with respect to the coordinates of u;, we get the equation 


(15.18) So Fij(--5 |e —ue/?, ..)(ui — uy) = 0 

JEN (i) 
(here F;; is the derivative of F according to the variable z;;). Since F is a 
nonconstant polynomial, at least one derivative Fi; is not identically 0. Then 
F;;(..., [us —uz|?,...) 4 0, since otherwise we could replace F by F;;. But this 
means that Sj; = F;;(...,|Us—ur|,...) defines a nonzero stress on (G,u). 

Conversely, assume that the edge-length are algebraically independent. We 
may also assume that the representation of the graph does not lie on any affine 
hyperplane, since otherwise we can proceed by induction on the dimension. We may 
also assume that we cannot add any further edge so as to preserve this property. 
We can fit the framework to the coordinate system, and thereby reduce the number 
of algebraically independent entries in u to nd— Cr this implies that the number 
of edges is at most nd— ‘ea? 

We claim that the framework is infinitesimally rigid. Suppose not, then by 
Theorem[15.2]]it has a flexing. During this, the distance of at least one nonadjacent 
pair of nodes (i, 7) must change, and since it changes continuously, at some time 7 
and j will have a distance that is algebraically independent of the edge lengths. Let 
us add this edge to create a graph G’, and let v: V — R¢ be the node positions 
at this time. Since we could not add the edge 77 to G in its original position, the 
distance |u;—u,| must be algebraically dependent on the edge-lengths of G. This 
determines an algebraic relation between the entries of u, and since it is generic, this 
algebraic relation must hold for all node positions, in particular when substituting 
v for u, a contradiction. 

Thus the framework is infinitesimally rigid. This implies that the number of 
edges must be exactly dn — Ce, and they cannot carry a stress by (15.11). This 
completes the proof of (a). 


(b) This part of the proof uses deeper tools than those used before. Fix the 
graph G, and consider the squared-edge-length map f : R¢*Y defined by (15.12). 
The framework (G, u) is stress-free if and only if the rank of Df(u) is m. We know 
by (a) that this occurs when u is generic. So the maximum rank of Df(u), over 
all vector labelings u: V — R%, is equal to m. Furthermore, a vector w € R¥ of 
squared edge-lengths is realized by some framework (G, u) carrying a nonzero stress 
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if and only if it is a critical value of the map f. By Sard’s Theorem [Sard 1943], 
the set of such vectors w has measure 0 in R”. 

On the other hand, consider the set of “bad” realizations (critical points of f) 
K = {u: rk(Df(u)) < m}. It is clear that this is a real algebraic variety over the 
rationals: the condition rk(Df(u)) < m is equivalent to saying that det(N) = 0 
for all mx _m subdeterminants N of R. Hence the set {(u, f(u)) : u€ K} is an 
algebraic variety over the rationals as well. By the Tarski-Seidenberg theorem on 
semialgebraic sets, the set f(A’) is semialgebraic over the rationals, i.e., it can be 
described as L,U---ULZ,, where each L; is the solution set of a finite system of 
polynomial equations and strict polynomial inequalities with rational coefficients. 

Now assume that the given linkage (G, ¢) has a realization (G, u) with a nonzero 
stress. This means that there is a w € f(K), where w € R® is defined by w. = 
€(e)?. Since the entries of £ are algebraically independent, so are the entries of 
w, and it follows that they cannot satisfy any nontrivial polynomial equation with 
rational coefficients. So w € L; for some L,; that is defined by strict polynomial 
inequalities. But this means that @ is an interior point of f(), which implies that 
f(K) has positive measure, a contradiction. 


This theorem implies a purely algebraic characterization of generic stress- 
freeness of a graph G. Consider dn variables xj, (¢ = 1,...,n;v = 1,...,d) and 
the polynomials 

d 


Pij = S (tin —- By) (ij € E). 


v=1 


Corollary 15.23. A graph G is generically stress-free in R% if and only if the 
polynomials p;; (ij € E) are algebraically independent. 


Combining parts (a) and (b) of Theorem we see that if a generic frame- 
work is stress-free, then every other realization of the same linkage is stress-free 
as well. An important and surprising generalization of this fact was proved in 
[Connelly 2005]: If we consider a generic framework (G,u) and another (not neces- 
sarily generic) realization (Gv) of the same linkage, then not only are their stress 
spaces simultaneously nonempty, and (as it also follows) not only have they the 
same dimension, but these stress spaces are equal as linear subspaces of R”. (Fig- 
ure shows that this conclusion does not remain valid without the genericity 
assumption.) 


Lemma 15.24. Let (G,u) and (G,v) be two realizations of the same linkage in 
R?, and assume that one of them is generic. Then Str(G,u) = Str(G, v). 


Proof. Let (G,u) be generic, and let (G’, u) be a maximal stress-free subgraph 
of (G,u). Then dim (Str(G’ +e, u)) = 1 for every e € E\ E(G’); let 8° be a nonzero 
stress on (G’+e,u). Trivially, S° is not zero on the edge e; this implies that the 
stresses S° (e € E\ E(G’)) are linearly independent. It also follows that they 
generate Str(G, u): Let S be any stress on (G, u), then there is a linear combination 
>>, GeS® that matches S$ on the edges in E'\ E(G"), and so the stress S— >, a-S° 
is supported on E(G"’). Since (G’, u) is stress-free, it follows that S = }>, a.S®. 

Our next goal is to show that every S¢ is a stress on (G’ +e, v) as well. Theorem 
[15.22]implies that the edge lengths of (G’ +e, u) are algebraically dependent. Using 
this, the stress S© can be constructed on (G’ +e, u) just like in the proof of Theorem 
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VAY & 


FIGURE 15.9. (a)-(b) Two realizations of the same linkage in the 
plane, only one of which has a stress: (a) is stress-free; (b) has an 
infinitesimal motion as shown, and therefore it carries a stress. (c)— 
(d) Two realizations of the same linkage in the plane with different 
stress spaces. The graph K43 has 7 nodes and 12 > 2:7—3 edges, 
and so every realization in the plane has a stress. The values of 
any stress on the three edges incident with the black node must 
have the same sign in (d), but not in (c). 


15.22(a), using (15.18). Note, however, that (15.18) is an identity, and so it holds 


true when we replace the coordinates of u by the coordinates of v: 


S- F;;(..., [Vs — Vel’, ---)(vi— Vy) = 0. 


j: ij €E(G' +e) 


But v realizes the same linkage as u, and so |v; —v,| = |u; —u,| for every edge ij of 
G. So this equation gives the same nonzero stress S° on (G’ +e, v) as on (G’ +e, u). 
As mentioned above, the framework (G’,v) is stress-free, so this is a unique stress 
on (G’+e,v). By the same argument as above, the stresses S* generate Str(G, v), 
which proves the lemma. 


15.4. The realization space 


What can we say about the space of all realizations of a linkage, at least in 
the generic case? Sharpening results of [Hendrickson 1992], the following theorem 
asserts that the realization space of a generic framework is 
a smooth manifold. To see that this statement is far from obvious, let us have a 
look at the simple example of a parallelogram in the plane with a triangle attached 
(Figure }15.10). To factor out congruences, let us nail down the vertices of the 
bottom triangle. (The space of all realizations could be obtained as the product 
of this space with R? x $1Z», taking into account the isometric transformations.) 
In a typical position, the parallelogram can be deformed in two directions. But in 
the special position when it is collinear, there are four possible directions we can 
deform it. So the realization space is not a manifold, but the union of four arcs 
connecting the two collinear realizations. 


Theorem 15.25. The space of realizations of a linkage defined by a generic frame- 
work (G,u) in R¢ is a smooth manifold of dimension dim(Infgu). 


Proof. Let t = rk(R@u). Since (G,u) is generic, we have rk(Rev) < 
rk(Reu) = t for every vector labeling v. By Lemma/]15.24| and (15.11), we have 
rk(Rg,y) = t for every vector labeling v realizing the linkage (G, @,). The matrix 
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FIGURE 15.10. Motions of a parallelogram with a triangle at- 


tached do not form a manifold. The triangle is fixed; this factors 
out isometries. 


Rew is the Jacobian of the map f(v) = (|vi—v,;|?: ij € E), and so it has maxi- 
mum rank t for every v € f~+(€,). By basic differential geometry, this implies that 
f-‘(éu) is a smooth manifold of dimension nd-t. 


Theorem [15.25] is not quite what we would like to see: we don’t want to distin- 
guish between congruent frameworks. Define a reduced realization as an equivalence 
class of congruent realizations. Is the space of reduced realizations (endowed with 
a metric in the natural way) still a manifold? We must be careful, as shown by the 
following example. 


Example 15.26. Consider a path of length 2 with nodes a,b,c (in this order), 
where the two edges have algebraically independent lengths a@ and 8. Any 2- 
dimensional realization of this linkage can be encoded by specifying the position of 
b, and the two angles formed by the edges with the positive axis. So the realization 
space is R? x $1 x $1, a nice 4-dimensional manifold. 

If we identify congruent realizations, then we can uniquely describe a reduced 
realization u by the distance of ug and uc, which fill the interval [|a—|,a+ 6]. 
So the reduced realization space is homeomorphic to a segment, which is not a 
manifold. Notice that the trouble (violation of the manifold property) is caused by 
those realizations that are collinear. 4 


Theorem 15.27. The space of reduced full-dimensional realizations of a linkage 
defined by a generic framework (G,u) in R® is a differentiable manifold of dimen- 


sion dim(Inf¢.u) — ow . 


The linkage may or may not have realizations in a lower dimension, but if it 
does, these are excluded from consideration. 


Proof. The proof combines the proof method of Theorem[15.25] with standard 
techniques in differential geometry, and we give a sketch only. Select an ordered 
set S = (so,..., Sa) of d+1 nodes, and consider all vector labelings in which these 
nodes are affine independent. We can fit S to the coordinate system; this selects a 
particular labeling from each equivalence class of congruent labelings. The set of all 
vector labelings with S' fitted to the coordinate system is described by dn— (Ae) 
variables, d of which are restricted to the positive semiline. 
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By the same argument as above, the realizations v of (Gu) in which v(S) is 
affinely independent is a t-dimensional manifold Mg. For realizations in which both 
of two (d+1)-sets of nodes S$ and S’ are labeled by affinely independent vectors, 
congruent realizations in the manifolds Ms and Ms can be identified, and the 
identification maps are smooth. So the manifolds Mg form an atlas of charts, 
endowing the reduced realization space with a differentiable structure. 


Since the space of all realizations is trivially compact, we get a stronger state- 
ment if every realization is full-dimensional. 


Corollary 15.28. If the linkage defined by a generic framework (G,u) in R?@ has 
no realization in R¢—1, then the space of all reduced realizations in R¢ is a compact 


differentiable manifold of dimension dim(Infg¢.u) — car 


15.5. Global rigidity 


Let us recall that a framework in R? is globally rigid, if every other realization 
of the same linkage in R® is congruent to it. In other words, the distances between 
adjacent nodes determine the distances between nonadjacent nodes. 

There are a large number of applied problems which can be modeled using the 
concept of global rigidity rather than local or infinitesimal rigidity. As an example, 
suppose we have a number of sensors scattered in the plane or in space. These 
could be ships on the ocean, or animals with embedded chips, or perhaps, in the 
future, diagnostic nano-devices in the body. We are able to measure the distance 
between some pairs (say, by the delay of radio signals), and would like reconstruct 
their geometric positions. This can be done if and only if the framework of the 
devices as nodes and the measured distances as edges is globally rigid. 

One expects that global rigidity is more difficult to handle than local rigidity, 
since two different realizations of the same linkage may not be continuously movable 
into each other, and so analytic tools like differentiation are not available here. This 
is indeed true, but there is a surprising variety of nontrivial results on global rigidity, 
of which we give a sampler. 

Let us warm up with the 1-dimensional case. Trivially, a disconnected graph 
is not rigid on the line in any sense, and it is not hard to see that every connected 
graph is locally and infinitesimally rigid with any representation. Global rigidity 
is more complicated. Our first observation is that if G has a cutnode a, and so we 
can write G = G;UG»2 where V(Gi)NV(G2) = {a}, |V(G;)| > 1, then from any 
representation (G,u) we can create another representation (G, u’) by reflecting the 
nodes of G, in ug. This shows that no representation can be globally rigid unless 
either all nodes of G; or all nodes of G2 are represented by the same point. 

Assume that G is 2-connected. Even if the nodes are represented by different 
points, global rigidity does not follow. 


Example 15.29. Consider a quadrilateral on nodes V(C4) = {a,},c,d} (in this 
cyclic order), and its 1-dimensional representation ug = 0,uy = 1,ue = 3,Uq = 2 
(Figure (15.11). Since Cy is connected, this representation is locally and infinites- 
imally rigid. Mapping the nodes to vg 2, Ub 3, Ue 1, va 0, we get an- 
other realization of the same linkage, which is not congruent to the first one, since 
|Ua —Ue| = 3 but |va—v_e| = 1. Thus (G,u) is not globally rigid on the line. It is 
easy to construct frameworks on the linea that are arbitrarily highly connected but 


not globally rigid (Exercise |15.13). ¢ 
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FIGURE 15.11. Even a 2-connected framework may not be globally 
rigid on the line. The pictures are vertically distorted to make the 
structure visible. 


However, if G is 2-connected and the labeling u: V — R is generic, then the 
framework will be globally rigid on the line. The proof of this fact is left to the 
reader as Exercise [15.12] 

One might expect that if a linkage is not globally rigid, i-e., it has two incon- 
gruent realizations, then one can obtain a second realization by slightly deforming 
the first one; in other words, the framework is flexible. This is, however, not the 
case, as 1-dimensional representations of any graph with a cutnode show. (It is 
easy to generalize this example to higher dimensions.) 

Which are the globally rigid representations of a graph? In the case of infinites- 
imal rigidity, generic representations (vector labelings) are the “best”, in the sense 
that if any representation is infinitesimally rigid, then every generic representation 
is rigid as well. Nothing like this holds for global rigidity: for example, labeling each 
node of a connected graph with the same vector gives a globally (even universally) 
rigid representation, but of course the graph need not be globally rigid (or even 
locally rigid) in a generic representation. We have seen examples of globally rigid 
but not infinitesimally rigid graphs in Figures and Figure [15.12 shows 
that the 3-cube graph is not globally rigid in generic position in the plane, but it 
has a nontrivial realization that is globally rigid. 


(a) (b) (c) (d) 


FIGURE 15.12. Structure (a) is not globally rigid in the plane 
(flip one triangle), but adding an edge (b) we get a globally rigid 
framework. (c) shows a generic representation of the cube graph, 
which is not infinitesimally rigid (just count degrees of freedom), 
and hence it is flexible and so not globally rigid. Representation 
(d) of the cube is globally rigid; see Exercise [15.16] 


So to make the question “Does G have a globally rigid representation?” mean- 
ingful, we have to impose some sort of nondegeneracy condition. Genericity is 
perhaps the most natural, but general position will be another interesting case. 


15.5.1. Global rigidity up to an affine transformation. Before starting 
with the analysis of global rigidity, let us discuss a weaker version of it. We say 
that a framework (G,u) in R?@ is globally rigid up to an affine transformation, if 
for every vector labeling v: V — R? realizing the same linkage, there is an affine 
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transformation A of the space such that v; = Au;. Note that the definition of 
global rigidity only differs in that it requires A to be an isometry. 


Example 15.30. Figure shows a rather trivial example in the plane. Note, 
however, that a noncongruent affine transformation in the plane preserves the length 
of segments in at most two directions. So all examples in the plane are similar to 
this one. 4 


FIGURE 15.13. A planar framework whose edge lengths are pre- 
served by a noncongruent affine transformation. The edges are 
curved in order to show the structure. 


One feels that rigidity up to affine transformations is quite close to global 
rigidity, but it is not so easy to handle the connection. We start with a rather 
simple characterization [Connelly 1982]. Let us say that in a framework (G,u) in 
R¢@ the edge directions lie on a conic at infinity, if there is a symmetric nonzero 
matrix Q € R@*¢ such that (u;—u,;)'Q(u;—u,j) = 0 for every edge ij € E. We 
say that this conic is defined by Q. 


Lemma 15.31. A framework (G,u) has a noncongruent affine image that preserves 
the edge lengths if and only if the edge directions lie on a conic at infinity. 


Another way of interpreting this condition is that there is a noncongruent infini- 
tesimal motion where the velocities Qu; depend linearly on the positions u;. 


Proof. Suppose that (G,v) is an affine image of (G,u), realizing the same 
linkage, but not congruent to (G,u). In other words, there is an affine transforma- 
tion A of R@ such that v; = Au, for all nodes i, but A is not a congruence. We 
may assume (by translation) that A is a homogeneous linear transformation. Since 
A preserves the lengths of edges, we have |A(u;—u,)| = |u;—u,| for every edge 
ij, which can be written as (u;—u,)'(A'A—J)(uj;—u,;) = 0. Since A is not an 
orthogonal matrix, Q = A'A—I 4 0 defines the conic at infinity. 

The converse follows by a similar argument. 


The matrix Q, defining the conic at infinity, has some further useful properties. 
Let S be any stress on (G,u). For every node i, we have 


> Sis(ui— ay) ™QCas— uy) = 0, 
j 


since every term is zero. Expanding, we have 


62 Sis) uj! Qu; _ 2u/Q 63 Siu) +50 SigujQu;. 
J J J 
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The first two terms are zero as S is a stress, so the last term must be zero as well: 


(15.19) > SijujQu; = 0. 
jEeVv 


Using a similar expansion, we get that 


(15.20) S > Si; (uj — ug) Q(uj — uy) = 0 


for any three nodes i, k and J. Also note that if u is full-dimensional, then (u;— 
u;)'Q(u;—u,;) 4 0 for at least one nonadjacent pair i,j € V (this is left to the 
reader as an exercise). 

The following lemma gives some simpler conditions under which affine images 
that preserve the lengths of edges are automatically congruent images. The first 
condition was formulated in [Connelly 2005], the second in [Alfakih-Ye 2013]. As 
a motivation for condition (b) below, let us recall (14.4): if (G,u) spans R¢, then 
every stress on it has corank at least d+1. 


Lemma 15.32. Let (G,u) be a framework in R%, and suppose that one of the 
following conditions hold: 


(a) (G,u) is generic and all nodes have degree at least d. 
(b) (G,u) ts in general position and carries a stress of corank d+1. 
Then the edge directions do not lie on a conic at infinity. 


Proof. Suppose that the edge directions do lie on a conic at infinity, defined 
by Q. Since the conditions are invariant under translation, we may assume that 
u, = 0 for an appropriately chosen node a. 


(a) We have a system of homogeneous linear equations for the entries of Q, 
and hence the existence of such a matrix Q is equivalent to the vanishing of certain 
determinants composed of quadratic polynomials of the vectors u,;. If these deter- 
minants vanish for the generic vector labeling u, then they vanish for every vector 
labeling in R¢. So for every other vector labeling v, there is a symmetric nonzero 
matrix M € R?*¢ such that (v;—vj)"'M(vi—v,) = 0 for every edge ij € E. 

Let vi; = (a) be obtained by replacing the first entry of u; by 1 for i > 1, and 
let v; = 0. Consider the matrix M as above, and let M’ be obtained from M by 
deleting its first row and column. For every edge ij with 7,7 > 1, we have 


(uj, —u,)'M’(uj—u) = (vi-vj) "M(vi—vj) = 0. 


Since the graph G\1 has degrees at least d—1, and u’ is a generic vector la- 
beling of it in dimension d—1, it follows by induction that M’ = 0. This 
means that we can write M = be]+e{b with some nonzero vector b, and so 
v) Mv; = 2(b'v;)(elv;) = 2(b'v;). For i € N(1), we have 

v) Mv; = (vi—v1)'M(vi—v1) = 0, 
and hence b'v; = 0. Since any d of the vectors v; (i € N(1)) are linearly indepen- 
dent, this is a contradiction. 


(b) Let S be a stress matrix with corank d+1. First, note that S cannot have 
an all-zero row; indeed, deleting such a row (say, row i) and the corresponding 
column, we would get a stress matrix on (G\i,u) of corank d, while the vectors u, 
(j #2) would still span the affine d-space; this would contradict (14.4). 
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It follows that for every node i, the equation >> j Sij(uj —uz) = 0 provides 
a linear dependence between the vectors u;—u; (j € N(t)), and hence (by the 
assumption that the representation is in affinely general position) there must be 
at least d+ 1 terms with nonzero coefficients. Hence the degree of each node is at 
least d+1. In particular, the vectors uj;—u; (j € N(i)) must span R¢. 

Now suppose that the edge directions lie on a conic at infinity defined by the 
symmetric nonzero matrix Q. Define q € RY by gq; = u/Qu;, then we have Sq = 0 
by (15.19). By assumption that S$ has corank d+1, the vector 1 and the columns 
of U' generate the kernel of S, which means that there is a vector b € R@ and a 
real number a such that g; = uJQu; = b'u;+ a for every i € V. 

Substituting these expressions, we get for every 7 and 9 


(u; +u;)'Q(u;—u,) = b'(u;—uj). 
Adding the equation (u;—u;)'Q(u;—u,;) = 0 (valid for ij € E), we get 
2u/Q(u;—u;) = b'(u;—u,). 
Since the vectors u,—u; (7 fixed, 7 € N(i)) span the space, it follows that 2Qu; = b 


for every i € V. But then the representation lies in a proper affine subspace, a 
contradiction. 


For more on frameworks whose edge directions lie on a conic at infinity, 
including connections with the Strong Arnold Property and with the comple- 
tion problem of semidefinite matrices, see and 
[Laurent—Varvitsiotis 2014]. 

The following lemma provides a way to prove that only affine (but not neces- 
sarily congruent) images preserve the lengths of edges. 


Lemma 15.33. Letu: V > R¢ andv: V — R®¢ be two vector labelings of the 
same graph G, and suppose that (G,u) and (G,v) carry the same stress S with 
corank d+1. Also suppose that (G,u) spans R¢?. Then v is an affine image of u. 


Proof. Consider a third labeling w; = (y') € R*4. Clearly S remains a stress 
on this framework too, so by (14.4), the effective dimension of (G, w) is at most d, 
and so there is a d-dimensional affine subspace L C R?¢ containing all of the vectors 
w;. The projection 7, of LZ to the first d coordinates must be bijective. Let m2 be 
the projection of L to the last d coordinates. It follows that T, 12 is an affine map 


from (G,u) onto (G,v). 


Theorem 15.34. Assume that a framework (G,u) spanning R¢ carries a stress S 
whose matrix is negative semidefinite and has corank d+1. Then (G,u) is globally 
rigid up to an affine transformation. 


Of course, assuming that S is positive semidefinite would lead to the same 
conclusion: just replace S by —S. 


Proof. Let (G,v) be another realization of the framework (G,u) in R?. Our 
goal is to show that S is a stress on (G, v) as well; then Lemma will complete 
the proof. Since S is a stress, the energy satisfies E(u) = 0 by (14.5), and since 
all edge-lengths are preserved, we have E(v) = E(u) = 0. Since S is negative 
semidefinite, the labeling v minimizes the energy, and so $ must be a stress on the 
framework (G,v) as well. By Lemma [15.33] this implies that v is an affine image 
of u. 
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15.5.2. Global rigidity in general position. The following condition for 
a graph to have globally rigid vector labelings in general position was proved in 


Alfakih 2017): 


Theorem 15.35. A graph has a globally rigid representation in R% in affinely 
general position if and only if it is either complete or (d+1)-connected. 


Proof. The case of complete graphs is trivial, so suppose that the graph G is 
not complete. First, suppose that G has a cutset T with |T| < d. Consider any 
general position vector labeling u of V, and let H be any hyperplane containing 
the vectors representing T but no other vector label (by the general position of the 
labeling, we can choose such a hyperplane H). Reflecting one of the components of 
G\T in H, we get another representation in which the edge lengths are the same 
as before, but which is not congruent. So (G,u) is not globally rigid. 

To prove the converse, let G be a (d+1)-connected graph. By Theorem 
it has an orthogonal representation in general position in R”~¢~!. Considering the 
Gram matrix of these vectors, we get a positive semidefinite G-matrix S of rank 
n—d—1 such that every (n—d—1) x (n—d—1) submatrix of S is nonsingular. 

Next, we construct a vector v € Ker(S) with nonzero entries. It suffices to 
show that for every 7 € V there is a vector w € Ker(S) with w; 4 0; a “generic” 
linear combination of such vectors is a vector in Ker(S) with all coordinates differ- 
ent from 0. If no such vector w exits, then elementary linear algebra shows that 
Sa = e; for some vector a € RY. S is positive semidefinite, hence it can be written 
as S = X'X for some X € R("-4-D*XV, where any n—d—1 columns of X are lin- 
early independent. Let Y be any (n—d—1) x (n—d—1) submatrix not containing 
column i, then e; = X'Xa implies that 0 = Y'Xa. Since Y is nonsingular, this 
implies that Xa = 0, whence Sa = X'Xa = 0, a contradiction. 

Scaling the rows and columns of S if necessary, we may assume that v = 1, so 
S1 = 0. We can extend 1 by d further column vectors in R” to a basis of Ker(S), 
and combine these to a single matrix W such that SW = 0. The columns of WT 
have the form wi = (,.) (i € V, ui € R*), and so SW = 0 means that S is a stress 
matrix on the vector labeling u of G. So (G, u) carries a positive semidefinite stress 
of rank n—d—1, and hence it is globally rigid up to an affine transformation. 

If (G, u) is not globally rigid, then its edge directions lie on a conic at infinity, 


defined by a matrix Q € R2?%¢. By (15.20), 
S > Six (uy — ug) Q(uj — ux) = 0 
J 


for every i and k. Choose k so that (u;—uy,)'Q(uj—ux) 4 0 for at least one j, 
then the columns of S corresponding j € N(k) are linearly dependent. Since the 
number of these entries is at most n—d—1, this contradicts the choice of S. Thus 
(G, u) is globally rigid. 

What remains is to argue that any d+ 1 of the vectors u, are affine independent, 
or equivalently, any d+1 of the vectors w; are linearly independent. Suppose not, 
then there is a nonzero linear combination b = Wa of the columns of W that has 
d+1 zeroes; but then Sb = SWa = 0, which shows that S has n—d—1 linearly 
dependent columns, a contradiction. 


15.5.3. Generic global rigidity. We say that a graph is generically globally 
rigid, if it is globally rigid for every generic vector labeling. It turns out that it 
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would suffice to require this for a single generic vector labeling; this is a nontrivial 
fact |Gortler-Healy—Thurston 2010), whose proof we do not reproduce here. Figure 
15.14) illustrates some of the difficulties. 


(a) (b) (c) 


FIGURE 15.14. Three generic frameworks in the plane with the 
same graph; (a) and (b) realize the same linkage. Three more 
frameworks are obtained by flipping the black node over the line 
connecting its neighbors, so neither one of these is globally rigid. 
(c) cannot be realized so that the two triangles do not overlap. So 
(a) has four noncongruent realizations, (c) has only two. 


In the case of generic frameworks, one can give a full characterization of 
global rigidity in terms of the existence of appropriate stresses. The “if” part 


of the following theorem was proved in {Connelly 2005), the “only if” part in 
Gortler—-Healy—Thurston 2010]. The proof of the “only if” part, which extends 


the methods of the proof of Hendrickson’s Theorem below, will not be reproduced 
here. 


Theorem 15.36. A generic framework (G,u) in R¢ with at least d+2 nodes is 
globally rigid if and only if it carries a stress S whose matrix has corank d+1. 


Proof. (Sufficiency.) Again, we consider another realization (G,v) of the 
framework (G,u) in R?. By Lemma|15.24| S is a stress on the framework (G, v) as 
well, and so by Lemma (G,v) is an affine image of (G,u). Lemma b) 
completes the proof. 


While this theorem gives an interesting and nontrivial condition for generic 
global rigidity, this condition is not easy to verify. It is related to two simpler 
algorithmic problems that are notorious for their unknown complexity: as we have 
seen, no polynomial time algorithm is known to decide whether a generic frame- 
work carries a stress, and no polynomial time algorithm is known to determine the 
maximum rank in a linear space of matrices (in this case, in the space of stresses). 
However, the methods developed for these two problems do yield a randomized 
polynomial time algorithm to determine whether a graph is globally rigid in R@ 
(see Exercise |15.15). 

From these considerations, it follows that more combinatorial necessary and/or 
sufficient conditions for generic global rigidity are valuable. There is a nontriv- 
ial necessary condition for global rigidity [Hendrickson 1992], which reveals that 
generic global rigidity is related to generic (local) rigidity in interesting ways. 
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Theorem 15.37. If a graph G on at least d+2 nodes is generically globally rigid 
in R¢, then it is (d+1)-connected and deleting any edge it remains generically rigid 
in Re. 


Proof. The first condition follows from (the easy part of) Theorem 

To prove the second, suppose that G’ = G'\e is not rigid for some edge e = 
ab € E. This means that for some (or any) generic labeling u in R?, the framework 
(G’,u) is not infinitesimally rigid. Since (G,u) is infinitesimally rigid, it follows 
that rk(Re,u) = dn— (“") and rk(Reru) = dn—- ("4") -1. 

Every realization of the linkage (G’,£,) spans R¢. Indeed, let (G”,u) be a 
maximal stress-free subgraph of (G’,u). Then, using that n > d+1, 


n(G)| > Ht Read) =a Re) San oo A tie @) 


and hence every (d—1)-dimensional realization of (G”,¢,) would have to carry a 
stress, contradicting Lemma[I5.24| 

By Corollary |15.28| the space of reduced realizations of the linkage (G’, Cy) is 
a compact manifold of dimension 1, i.e., it consists of one or more closed Jordan 
curves. Let J be the curve through the congruence class of u. 

The function f(x) = |x,—x»|? is invariant under congruences, and hence it 
can be viewed as a (smooth) function on J. The vector labeling u cannot be a 
stationary point of f: this would give an algebraic equation between the coordinates 
of u, which would be identically true, and hence f would be a constant, and moving 
along J we would get a flexing of the whole graph G (not only of G’). Hence u 
is neither the maximizing nor the minimizing argument of f, and hence there is 
another realization v along J such that f(u) = f(v). This means that (G,v) 
is a noncongruent realization of the linkage (G,¢y), and so (G,u) is not globally 
rigid. 


The converse of Theorem is not valid [Connelly 2005]: K5,5 is not globally 
rigid in R°, but it meets the necessary conditions in Hendrickson’s Theorem. On 
the other hand, these conditions are sufficient in the plane : 
In fact, Jackson and Jordan prove the following result, quite analogous to Theorem 
(for the proof, the reader is referred to the original paper.) 


Theorem 15.38. For a graph G on at least 4 nodes, the following are equivalent: 
(a) G is globally rigid in the plane; 
(b) G is 3-connected and deleting any edge, it remains rigid in the plane; 


(c) G can be obtained from K4 by a succession of the second Henneberg con- 
structions (H2) and edge additions. 


Exercise 15.1. Prove that a framework (G, u) is flexible on the line if and only 
if G is disconnected. 


Exercise 15.2. Prove that in the plane i +> e1, i + e2 andi+> Ru; (where 
R denotes rotation by 90°) are infinitesimal congruences, which generate Rig. In 
3-space, 7+ e€1, 1H €2,1H €3,7 e1 X Ui, 7+ e2 X U; and? +> e3 X u; generate 
Rig. For which frameworks are these generators linearly independent? 
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Exercise 15.3. (a) Show that (15.5) defines an infinitesimal motion. (b) Let U(t) 
(t € [0,1]) be a smooth family of congruences in R*. Prove that (d/dt)(0)u = 
w+ Au, where w € R? and A is a skew symmetric matrix. Conversely, every 
affine transformation u ++ w+ Au, where w € R®@ and A is a skew symmetric 
matrix, arises from a smooth family of congruences. 


Exercise 15.4. Let u be a representation of G in R¢. Prove that a vector T € R® 
is in the orthogonal complement of Str(G, u) if and only if there are vectors v; € R@ 
such that Ti; = (vi—vj)'(ui—u;) for every edge ij. 

Exercise 15.5. Let (G,u) be a rigid framework that has at least one cable or 
strut. Prove that it has a nonzero stress. 

Exercise 15.6. Let G be the octahedron graph, where V = {1,2,3,4,5,6} and 
E = {12,34,56}. Consider the following vector labeling in R*: let ui, uz, us, u4 
be points of the unit circle (in this cyclic order) of the plane z = 0 so that 
4£(ug,u3) = £(u4,u,). Let us and ug be arbitrary points on the z-axis. Prove 
that (G, u) is flexible. 


Exercise 15.7. Let u,v',v”..., be noncongruent realizations of the same linkage 
(G, £), and suppose that v? — u; for all nodes. Prove that (Gu) has a nontrivial 
flexing. 

Exercise 15.8. Let (G,u) be a framework and v, an infinitesimal motion. Prove 
that the frameworks (G,u+v) and (G,u—v) define the same linkage (have the 
same edge lengths). 


Exercise 15.9. Let (G,u) be a generic framework with a stress S. Prove that 
there is a nonzero polynomial F' in m variables x:; (ij € E) such that Si; = 
Fy; (. a oy Jus — ul’, Sates ). 

Exercise 15.10. Is the configuration space of the Peaucellier—Lipkin linkage (Ex- 
ample[15.1) a closed manifold? 


Exercise 15.11. Prove that if linkage with algebraically independent edge lengths 
has a realization, then it has a generic realization. 


Exercise 15.12. . Prove that a graph G is generically globally rigid on the line 
if and only if it is 2-connected. 


Exercise 15.13. For every k > 1, construct a k-connected framework on the line 
that is not globally rigid. 


Exercise 15.14. Prove that if a framework (G, u) is globally rigid up to an affine 


transformation, but not globally rigid, then there is a nonzero matrix Y € RV*Y 
such that Yi; = 0 for ij € HUA, and YS = 0 for every stress on (G, u). 


Exercise 15.15. Prove that generic global rigidity of a graph in R@ can be decided 
in randomized polynomial time. 


Exercise 15.16. Prove that the framework in Figure|15.12{(d) is globally rigid in 
the plane. 


CHAPTER 16 


The Colin de Verdiére Number 


Let G be a connected simple graph. We know that (by the Perron—Frobenius 
Theorem) the largest eigenvalue of the adjacency matrix has multiplicity 1. What 
can we say about the multiplicity of the second largest? 

The eigenvalues of most graphs are all different, so the multiplicity of any of 
them gives any information about the graph in rare cases only. Therefore, it makes 
sense to try to maximize the multiplicity of the second largest eigenvalue by weight- 
ing the edges by positive numbers. The diagonal entries of the adjacency matrix do 
not carry any information, so we allow putting there arbitrary real numbers. The 
off-diagonal matrix entries that correspond to nonadjacent pairs of nodes remain 0. 

There is a technical restriction, transversality (Strong Arnold Property as de- 
fined in Section [10.5), to exclude choices of edgeweights and diagonal entries that 
are too degenerate. 

Since we do not put any restriction on the diagonal entries, we may add a 
constant to the diagonal entries to shift the spectrum so that the second largest 
eigenvalue becomes 0, without changing its multiplicity. The multiplicity in ques- 
tion is then the corank of the matrix (the dimension of its nullspace). 

Finally, we multiply the matrix by —1, to follow convention. Recall that we have 
met such matrices in Chapter canonical braced stress matrices of 3-polytopes 
have all these properties (cf. Proposition [14.14). 

This multiplicity, as a graph parameter, was introduced and first stud- 
ied in {Colin de Verdiére 1986] and [Colin de Verdiére 1991]; for a survey, see 
van der Holst et_al. 1999]. Its most interesting feature is that it is related to 
topological properties of the graph: for example, its value is at most 3 if and only 
if the graph is planar. For this book, its connection to geometric representations 
makes it particularly interesting. For example, we are going to show that for 3- 
connected planar graphs, the optimizing matrices in the definition above correspond 
to Steinitz representations (representations as skeletons of convex 3-polytopes). 


16.1. Basics 


We continue with the formal definition. Recall that the a G-matrix A is 
transversal, or has the Strong Arnold Property, if AX # 0 for every nonzero G- 
matrix X with zero diagonal. 

A well-signed G-matrix with exactly one negative eigenvalue and with the 
Strong Arnold Property will be called a Colin de Verdiére matrix of the graph 
G. The Colin de Verdiére number ~(G) of the graph G is defined as the maximum 
corank of any Colin de Verdiére matrix of G. 

We can formulate this problem as a minimum dimension problem for orthogonal 
representations—not in Euclidean space, but in Minkowski space. To define this 
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space, we equip the linear space R” with an indefinite inner product 
(x, y) = 7141 eres + 2n-1Yn-1 —ZXnYn- 


In other words, with the matrix 


«fay 0 
ando= (MS), 


we have (x,y) =x!Ay. Using this formalism, every symmetric matrix A € R with 
one negative eigenvalue can be written as a “Gram matrix”: A;; = (u;,u,) with 
appropriate vectors u; € R?%, where d is the rank. This explains the analogies in 
our treatments here and in Section[10.5} but the indefiniteness of this inner product 
causes difficulties to be taken into account. 


Example 16.1. It is clear that j4(/1) = 0. We have p(G) > 0 for every graph with 
at least two nodes. Indeed, we can put “generic” numbers in the diagonal of —Ag 
to make all the eigenvalues different, and then we can subtract the second small- 
est eigenvalue from all diagonal entries to get one negative and one 0 eigenvalue. 
Transversality, as noted in Section [10.5.2 is automatic for matrices with corank 1. 

This discussion implies the useful fact that if G has at least two nodes, then it 
is sufficient to require that M has at most one negative eigenvalue. Indeed, if M is 
positive semidefinite, then 0 is its smallest eigenvalue, and so the Perron—Frobenius 
Theorem implies that it has multiplicity 1. So the only case when dropping the 
condition that M has a negative eigenvalue would allow larger corank is when 
u(G) = 0, ie, when G= ky. 4 


Example 16.2. For a complete graph K, with n > 1 nodes, it is easy to guess 
—J as a Colin de Verdiére matrix. It is trivial that —J is a well-signed G-matrix 
with one negative eigenvalue. Transversality is trivial again. The corank of —J 
is n—1, and one cannot beat this, since at least one eigenvalue must be negative. 
Thus p(K,) =n-1. ¢ 


Example 16.3. Next, let us consider the graph K,, consisting of n > 2 isolated 
nodes. All entries of a Colin de Verdiére matrix M, except for the entries in the 
diagonal, must be 0. We must have exactly one negative entry in the diagonal. 
Trying to minimize the rank, we would like to put 0’s in the rest of the diagonal, 
getting corank n—1. But this is where the Strong Arnold Property enters: we can 
put at most one 0 in the diagonal! In fact, assuming that (say) M11 = M22 = 0, 
the matrix X with 
‘ if {7,7} = {1, 2}, 

Xij= . 
0, otherwise 


violates the Strong Arnold Property. (We could have appealed to Lemma|10.26}) 
So we must put n—2 positive numbers in the diagonal, and we are left with a single 


0. It is easy to check that this matrix will satisfy (M3), and hence p(K,)=1. 


Example 16.4. Consider the path P, on n > 2 nodes, labeled 1,2,...,n in their 
order on the path. Consider any Colin de Verdiére matrix M for P,,, and delete 
its first column and last row. The remaining matrix has negative numbers in the 
diagonal and 0’s above the diagonal, and hence it is nonsingular. Thus the corank 
of M is at most 1. We have seen that a corank of 1 can always be achieved. Thus 
(Py) = 1. 4 
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Example 16.5. Consider a complete bipartite graph Kp, where p > q > 2. In 
analogy with K,,, one can try to guess a Colin de Verdiére matrix with low rank. 


The natural guess is 
0 —-J 
v=(r v): 


where J is the pxq all-1 matrix. This clearly belongs to Mk, 1 and has rank 2, 
which seems to imply p+q—2 for u(K,,). But it turns out that this matrix does 
not have the Strong Arnold Property unless p,q < 3. In fact, consider a matrix 


Y 0 
x= (0 2) 


where Y is a pXp symmetric matrix with 0’s in its diagonal and Z is a qxq 
symmetric matrix with 0’s in its diagonal. The condition MX = 0 is then equivalent 
to requiring that Y and Z have 0 row-sums. It is easy to see that this implies that 
Y=Z=Oif p,q < 3, but not if p> 4. 

This establishes that u(Kp,4) > p+q—2 if p,q < 3, and equality holds here 
(Exercise|16.2); but if p > 4, then our guess for the matrix M realizing p4(Kp,q) does 
not work. We will see that in this case yu will be smaller (equal to min{p, g}+1, in 
fact). 4 


Remark 16.6. There is a quite surprising fact in this last Example, which also 
underscores some of the difficulties associated with the study of jz. The graph K4,4 
(say) has a node-transitive and edge-transitive automorphism group, and so one 
would expect that at least some optimizing matrix in the definition of 1 will share 
these symmetries: it will have the same diagonal entries and the same nonzero off- 
diagonal entries. We have seen that an analogous statement is valid for orthogonal 
representations in the definition of the theta-function (Theorem (11.5). But here 
this is not the case: it is easy to see that this would force us to consider the matrix 
we discarded above. So the optimizing matrix must break the symmetry! 


16.1.1. Properties. Let us summarize some basic properties of the Colin de 
Verdiére number. We start with developing the combinatorial condition in Lemma 
10.26] further for Colin de Verdiére matrices. 


Lemma 16.7. Let M be a Colin de Verdiére matrix of a graph G, let S CV, and 
let Mg be the submatrix of M formed by the rows and columns in S. If Mg ts 
positive semidefinite, then cork(Msg) < 3. 


Proof. Since Msg is a well-signed positive semidefinite G[S]-matrix, cork(Ms) 
is equal to the number of connected components of G[S] for which the corresponding 
diagonal block of Ms is singular. Let G,...,G, be these components of G[SI, 
and let M,...,M;, be the corresponding diagonal blocks of Ms. By the Perron— 
Frobenius Theorem, Ker(M/;) is 1-dimensional and contains a vector x; > 0. Let 
z > 0 be an eigenvector belonging to the negative eigenvalue of M/. We extend x; 
with 0’s to a vector in RY. Then z'x; > 0, and we may scale 2; so that z'x; = 1. 
The difference x; — x; satisfies z'(a; -—x;) = 0, and hence it belongs to the subspace 
generated by the eigenvectors of M with nonnegative eigenvalues. Since trivially 
(a; -2;)'M(a;—2;) =0, this implies that M(2;—2;) =0. 

Now if r > 4, then 7; —x2 and x3 — 4 are two vectors in Ker(M) with disjoint 
supports that are not connected by any edge, contradicting Lemmaf[l10.26] 
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It is easy to see that if G is a graph with at least one edge and with connected 
components H,,...,H,, then 


(16.1) u(G) = max{u(A1),..., w(He)} 


(Exercise |16.3). It follows, in particular, that if G has at least one edge, and we 
add a new, isolated node, then s(G) does not change. The behavior of 4 under 
the opposite way of adding a new node is also easy to describe but a bit harder to 
prove. 


Lemma 16.8. Let G be a graph with at least one edge, and let us add a new node 
v connected to all of the other nodes to get a graph G’. Then p(G’) = u(G) +1. 


Proof. Let M be an optimal Colin de Verdiére matrix of G, and let x be a 
positive eigenvector belonging to the negative eigenvalue. We may assume that 
Mx = —«x. Consider the matrix 


M -2 
ri 
— (2 =) 
(where v corresponds to the last row). Clearly M’ is a well-signed G’-matrix. It is 
easy to check that it has the Strong Arnold Property: We have to check matrices 
of the form X’ = « 5) , where X is a nonzero G-matrix with zero diagonal. For 
a matrix of this form M’X’ = 0 implies that MX = 0, which implies that X = 0, 
since M has the Strong Arnold Property. It is easy to see that rk(M’) = rk(M), 


which shows that 
u(G’) > cork(M") = cork(M) +1 = p(G) +1. 


To prove that equality holds here is a bit more cumbersome. We may assume 
that ~(G’) > 3, else the assertion is obvious. Let M’ be an optimal Colin de 
Verdiére matrix of G’, then it is of the form 


,_ (Mou 
m= (Sr 4) 


with some vector u < 0 and real number ¢. It is clear that M is a well-signed 
G-matrix. By interlacing eigenvalues, M has at most one negative eigenvalue. It is 
easy to see that cork(M) > cork(M’)—1. Assuming that M’ has the Strong Arnold 
Property, the definition of 4 and the remark in Example imply that 

u(G) > cork(M) > cork(M‘')—1 = p(G")-1. 
So it suffices to prove that M has the Strong Arnold Property. Let X be a nonzero 
G-matrix with zero diagonal, and suppose that MX = 0. Then X’ = “ : is 


a nonzero G’-matrix with zero diagonal. Computing the product 


as MX 0 0 0 
oo (Ar. 5) ~ (wrx 5) 
we see that to get a contradiction it suffices to show that u'X = 0. This will follow 
if we show that u is in the column space of M, since then u = Mv with some vector 
v, and so ulX = v'MX =0. 
Suppose that this is not the case, then adding the column u to M increases its 
rank, and adding the row (u',t) increases it even further, so rk(M’) > rk(M)+2, 
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and so cork(M) > cork(M’)+1. By Interlacing Eigenvalues, this can only happen 
if M is positive semidefinite. So cork(M) is the number of those components of G 
for which the corresponding submatrix of M is singular (i.e., not positive definite). 
By Lemma [16.7] there are at most 3 such components, so cork(M’) < 2, which 
contradicts the assumption that u(G’) > 3. 


Probably the most important property of jz is minor-monotonicity. The proof 
follows the same line of thought as for the algebraic width (see the proof of Theorem 
10.33), and is not reproduced here. 


Theorem 16.9. If graph H is a minor of graph G, then u(H) < p(G). 


Example 16.10. Let Ke , denote the graph obtained from the complete bipartite 
graph K,,, by adding all edges in the color class of size p. We are interested in 
the case q > 3. We can also think of fia as taking q cliques G,,...,Gz with p+1 
nodes each, selecting a p-subset of nodes in each, and glue these together. For p = 1 
the graph is just a star, which (as we have seen) has ul Ky) = 2. For p > 1, the 
graph G is obtained from a star by repeatedly adding a new node and connecting 
it to all previous nodes, so by lemma|16.8| w(K) = pti. 4 


Example 16.11. We can settle the important example of the complete bipartite 
graph K, 4, where p < g. We have seen that (41,1) = 1 and w(K, 4) = pt+q—2 
if 2<q <3. Suppose that q > 4. Since Kp, C Kj, it follows that u(Kp,q) < 
BAS) = p+l. On the other hand, contracting p—4 disjoint edges of K,, we 
get a graph H which can also be obtained from K3 3 by adding p—3 nodes and 


connecting them to all other points. By Lemma pH) = p(K33)+p—3 = 
4+ (p—3) = p+1, and so by minor-monotonicity, (Kp) > p+. 4 


Next, we study what happens to the Colin the Verdiere number if the graph is 
decomposed along a cutset that is a clique. In other words, the graph is obtained 
by gluing together k > 2 graphs Gj,...,G, along cliques of the same size. Can 
u(G) be larger than the maximum of the u(G;)? This can happen, as shown by 
Example[I6.10] But in a sense this is the worst case. 

Let us fix a simple graph G, let S C V induce a clique, and let H,,...H, be the 
connected components of G\S. Let V; = V(H;), and let G; be the subgraph of G 
induced by SUV;. For a G-matrix M, let M; be the restriction of M to the rows 
and columns of H;. Gluing two or more graphs together along a complete subgraph 
of each is often called a clique-sum. So the graph G is the clique sum of the graphs 
Hi (along the specified cliques H;[S]). The following lemma describes the behavior 
of the Colin de Verdiére number under clique-sums. 


Lemma 16.12. With the notation above, 


max{p(G1),.--,u(Gx)} < w(G) < 1+max{u(G1),..., u(Ge)} 


If the upper bound is attained, then |S| = u(G)—2, and we can contract two or 


three of the components and delete the rest to obtain the graph KI ej9° 


Proof. Since every G; is an induced subgraph of G, it follows by Lemma 
that u(G) > max{p(G1),..., u(Ge)}. 

To prove the upper bound, we may assume that N(V;) = S for all i, since 
otherwise we could replace S by N(V;), which leads to a tighter inequality (the 
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discussion of the case of attaining the upper bound takes some work, which is left 
to the reader). The matrix M has the following form: 


Mo Ni... Np 
1 MM 0 
M — * . 7 
N; 0 Mg 


where the first block corresponds to S and the other blocks, to Vi,..., Vx. 

We start similarly as in the proof of Lemma|16.7] Let z and x; be eigenvectors 
belonging to the smallest eigenvalues of MM and M;, respectively, where we extend 
x; with 0’s to a vector in RY. By the Perron—-Frobenius Theorem, we have z > 0 
and a; > 0 (in fact, x;|v, > 0). This implies that z'x; > 0, and we may scale x; so 
that z'x, = 1. 

The difference 2;—2; satisfies 2"(a;—2;) = 0, and hence it belongs to the 
subspace spanned by the eigenvectors of M with nonnegative eigenvalue. This 
implies that 


(16.2) (aj-2;)'M(a,-2;)>0 (1<i,j<k). 


The matrix M,®---®M, is a symmetric submatrix of M, and hence (by In- 
terlacing Eigenvalues) it has at most one negative eigenvalue. So the matrices M; 
are positive semidefinite with at most one exception. 


Case 1. One of the matrices M; (say, M1) has a negative eigenvalue. We claim 
that the other M; are positive definite. Suppose not, and let (say) M2 have a zero 
eigenvalue. Then x2|y, € Ker(M2), and hence supp(Mzz2) C S. Using this, we get 
that 

(a1 —22)'M(a1 —£2) = iMxy <0, 
which contradicts (16.2). 

So My,..., My are positive definite, and, in particular, nonsingular. We can use 
row operations and simultaneous column operations to annulate the submatrices 
N; and Nj, to get the matrix 


MN, 0 .«. 0 

Ni M 0... O 

wa|0 0 Mp 0 
0 0 O My 


with Mj = Mo—>o,3 NiM;'N;’. Consider the matrix 


/ M5 MN 
w= (Nt an) 

By the Inertia Theorem, the eigenvalues of M have the same signs as the eigen- 
values of the matrix M , which are clearly the eigenvalues of M’ together with the 
eigenvalues of Mp,...,M,. Thus M’ has one negative eigenvalue and cork(M) zero 
eigenvalues. 

We claim that all off-diagonal entries of M’ are nonpositive. Indeed, this holds 
for Mp and Mj, and the matrices N; and N;' are nonpositive, so it suffices to argue 
only that the off-diagonal entries of the submatrix Mj = Mo— 0,55 NiM;'NJ are 
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nonpositive; they are in fact negative. The matrix M;* is nonnegative by Lemma 
for i > 2. Hence all entries of N;M;'Nj are nonnegative, and subtracting 
from Mo, the off-diagonal entries remain negative. 

It follows that M’ is a well-signed G,-matrix. We claim that it has the Strong 
Arnold property. Indeed, let X be a nonzero Gj-matrix with zero diagonal, and 
suppose that M’X = 0. Since S induces a clique, we can write 


0 Y 
For any set of V; x V; matrices U; (j = 2,...,k), the matrix 
0 Y oO... O 
YT Z Up ... Up 
wa | UW DT we 0 
0. OF 0 as 0 


is a nonzero G-matrix with zero diagonal. We want to choose the matrices U; so 
that MW = 0; a little playing with these matrices shows that U; = —-Y'N,M;' is 
a right choice. This contradicts the Strong Arnold Property of M. 

Thus we see that M’ has the Strong Arnold Property, and so it is a valid Colin 
de Verdiére matrix for G,. Hence 


u(G) = cork(M) = cork(M’) < (Gi) < max{pu(G1),..., u(Gx)}. 


This completes Case 1. Note that the upper bound in the lemma is never attained 
in this case. 


Case 2. All matrices M; are positive semidefinite. Since contracting V; we get 
a complete (|S|+1)-graph as a minor of Gi, we have u(G1) > |S. So we are done 
unless |S| < u(G); let us assume that this is the case. 

Let D = {x € Ker(M): 2|g = 0}. We have dim(Ker(M/)) = u(G) > |S], which 
implies that D 4 0. Since D C Or, Ker(M;), at least one of the matrices M; must 
be singular. Let (say) M,...,M, be singular, then their least eigenvalues are 0, 
and the corresponding eigenvectors 7; have multiplicity 1. Since z'w, = 1 4 0, we 
have x; ¢ D, and so 

k 
dim(D) < S$ dim(Ker(M;)) = r. 
i=1 

Not all vectors on S are restrictions of vectors in Ker(M); for example, 1g is 
not. Indeed, if 1g = a|g for some x € Ker(M), then 


0=2]Mz=2]Ma2ls+2]Miz|v, =21Milg <0, 


since all entries of z]{N; are negative. This contradiction implies that dim(D) > 
pu(G)—|S| > 1 and so r > 1+dim(D) > 3. 

Since r < 3 follows by Lemma [16.7] we must have equality in the estimates 
above. In particular, |.$| = u(G)—1, which means that G can be contracted to 
K,yq); and so u(G1) = u(G)—1. 

Furthermore, G\S must have at least three components. Contracting three of 
these and deleting the others, we get the graph Kr 


G)-1,3" 
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Corollary 16.13. If u(G) > 2, then u(G) is invariant under subdivision. 


Proof. Minor-monotonicity implies that if we subdivide an edge, then ~(G) 
cannot decrease. Lemma implies that if we glue a triangle onto an edge 
of G, then p(G) does not increase (as long as it is larger than 2). Deleting the 
common edge of the triangle does not increase (again by minor-monotonicity), 
which means that subdividing the edge does not increase p(G). 


We quote without proof the following result [Bacher—Colin de Verdiére 1995}; 
the proof goes along similar lines as the previous one, with more complications. A 
A-—Y transformation of a graph G means creating a new node, selecting a triangle 
T in G, connecting the new node to the nodes of T’,, and deleting the edges of T. 
The reverse operation is called Y—A: this means that we delete a node of degree 
3, and connect each pair of its neighbors (this may create parallel edges). 


Theorem 16.14. If (G) > 3, then u(G) ts invariant under A—Y _ transforma- 
tions. 


16.1.2. Small and large values. Graphs with py < 4 have very interest- 
ing topological characterizations. Statements (a)-(c) of the following theorem 
were proved in [Colin de Verdiére 1990]; an elementary proof of (c) is given in 


[van der Holst 1995]; (d) was proved in [Lovasz—Schrijver 1998}. 
Theorem 16.15. Let G be a connected graph. 
(a) w(G) < 1 af and only if G is a path; 
(b) u(G) < 2 if and only if G is outerplanar; 
(c) u(G) < 3 tf and only if G is planar; 
(d) u(G) < 4 if and only if G is linklessly embeddable in 3-space. 


Graphs with large Colin de Verdiére number, n—4 and up, have been studied 
in [Kotlov-Lovasz—-Vempala 1997]. The results are in a sense analogous to those for 
small numbers, but they are stated for the complementary graph (this is natural, 
since these graphs are very dense), one has to exclude twin nodes (this is a minor 
technical complication), and they are less complete (this is unfortunate). We only 
state two cases. 


Theorem 16.16. Let G be a simple graph without twins. 

(a) IfG is outerplanar, then u(G) > n—4; if G is not outerplanar, then u(G) < 
n—A4; 

(b) If G is planar, then u(G) > n—5; if G is not planar, then u(G) <n—5. 


So for graphs with w(G) = n—5, their complement may or may not be planar; we 
do know that their complement is not outerplanar. 

The proof of these theorems will take up most of this chapter. Our main tools 
will be two vector-labelings discussed in the next two sections. 


16.2. Small values and nullspace representation 


16.2.1. Nullspace representations. Every G-matrix M defines a vector- 
labeling of the graph G in cork(/) dimensions as follows. We take a basis v1,..., Va 
of the null space of M, write them down as row vectors, to get a matrix U. Each 
node 7 of the graph corresponds to a column u; of the matrix obtained, which is a 
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vector in R%, and so we get the nullspace representation of the graph. Saying this 
in a fancier way, each coordinate function is a linear functional on the nullspace, 
and so the nullspace representation is a representation in the dual space of the 
nullspace. 

The nullspace representation satisfies the equation UM = 0; in other words, 
M defines a braced stress on (G,u). The nullspace representation is uniquely 
determined up to a linear transformation of R¢. Furthermore, if D is a diagonal 
matrix with nonzero diagonal, then (D~!u; : i € V) is a nullspace representation 
derived from the scaled version DM D of M (which in many cases can replace /). 
If all the vectors u; are nonzero, then a natural choice for D is Dj; = |u;|: this scales 
the vectors u,; to unit vectors, i.e., we get a mapping of V into the unit sphere. We 
will see that other choices of this scaling may give even nicer representations. 


16.2.2. Colin de Verdiére number and planarity. We start with proving 
assertions (a)—(c) of Theorem}16.15} Part (d) will need more preparation. 


Proof. (a) We have seen (Example [16.4) that every path P, has p(P,) = 1. 
Hence, so does every subgraph of a path with at least one edge (i.e., a graph whose 
connected components are paths). On the other hand, if a graph G is not a subgraph 
of a path, then either it has a node of degree at least 3, or it contains a cycle. This 
implies that either K3 or K3, is a minor of G. Since u(K3) = p(Ki3) = 2 
(Examples [16.2] and [16.11), this implies that u(G) > 2 by minor-monotonicity. 

(b) Suppose that G is outerplanar. To show that u(G) < 2, we may assume 
that G is maximal outerplanar, i.e., a triangulation of a polygon. In this case, G 
can be obtained from a triangle by repeatedly taking clique-sums with a triangle. 
By Lemma|16.12] this implies that p(G) = 2. 

If G is not outerplanar, then it contains either K4 or K32 as a minor. These 
graphs have pu(K4) = p(K3,.2) = 8, and hence p(G) > 3 by minor-monotonicity. 

(c) Suppose that G is planar. Since M is a well-signed G-matrix with ex- 
actly one negative eigenvalue, Theorem|[14.12] implies that cork(/) < 3, and hence 
WG) <3. 

The converse follows by the same type of argument as in the previous cases: 
If G is not planar, then it contains either K; or K3 3 as a minor. Since u(K5) = 
p(K3.3) = 4, this implies that u(G) > 4. 


16.2.3. From Colin de Verdiére matrices to Steinitz representations. 
The theorem proved above implies that every planar graph has Colin de Verdiére 
number 3. Our main tool was the Discrete Nodal Theorem (through its 
corollaries). We show that we get much more out of this. 

Consider a 3-connected planar graph G, a Colin de Verdiere matrix M of G, 
and the nullspace representation u in R® obtained from M. Recall that this satisfies 


(16.3) So Miyjuj=0 (eV). 
J 


This is a vector labeling of G in R%. We have seen another important vec- 
tor labeling of such graphs, namely their Steinitz representation by a con- 
vex polytope. It turns out that these two vector labelings are closely related 
[Lovasz 2001): The vectors u; can be scaled to give a 
Steinitz representation, and conversely, every Steinitz representation yields a Colin 
de Verdiére matrix in a natural way. We discuss these constructions in this and in 
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the next section. (See Exercises [16.4] and [16.5] for scaling results in other, simpler 
cases. ) 

The main step in the construction is the nullspace representation obtained from 
the Colin de Verdiére matrix, as defined in Section [14-2] As a special case of this 
construction, any Colin de Verdiere matrix yields a representation of a graph G in 
y(G)-space. In this case we have M;; < 0 for every edge ij, and so writing the 
definition as 

S- (—Mi;)u,; => Mu; for all 1 = V, 
JEN (i) 
we see that each vector u, is in the cone spanned by its neighbors, or in the negative 
of this cone (depending on the sign of M;;). It follows that if a hyperplane contains 
u;, then either it contains all neighbors 7, or 7 has neighbors on both sides of the 
hyperplane. 

For the rest of this section, let G be a 3-connected planar graph, let M be a 
Colin de Verdiére matrix of G with corank 3, and let (u;: 7 € V) be the nullspace 
representation derived from M. Since G is 3-connected, its embedding in the sphere 
is essentially unique, and so we can talk about the countries of G. 

We say that the Colin de Verdiére matrix M of G is polyhedral if the vectors 
u,; are the vertices of a convex polytope P, and the map 7 +> u; is an isomorphism 
between G and the skeleton of P. Note that the vectors u; are determined by M 
up to a linear transformation of R*, and the skeleton graph of this polytope does 
not depend on this linear transformation. 

Recall that Corollary (12, Tifmplies that for every plane H through the origin, 
the induced subgraphs G[V,;] and G[V,;] are connected. Another property of the 
representation u we need is the following. 


Lemma 16.17. If a,b,c are distinct nodes of the same country, then Ug, Up and 
u. are linearly independent. 


In particular, no node is represented by the 0 vector, and any two adjacent 
nodes are represented by nonparallel vectors. 


Proof. Suppose not. Then there exists a plane H through the origin that 
contains u,,u, and u,. Corollary implies that the sets of nodes Ve and 
V,, induce nonempty connected subgraphs of G. We know that |Vy| > 3; clearly 
either Vj, = Vx or Vj, is a cutset, and hence |Vj,| > 3. Thus G has three pairwise 
node-disjoint paths, connecting a, b and c to distinct points in Vj. We may assume 
that the internal nodes of these paths are not contained in V;,, and so they stay 
in Vy. Contracting every edge on these paths as well as the edges induced by Ve 
and V;,, we obtain a planar graph in which a, b and ¢ are still on one country, and 
they have two common neighbors, which is clearly impossible. 


This lemma implies that each u; is nonzero, and hence we may scale them 
to unit vectors (through the appropriate scaling of the matrix W/). We call such 
a nullspace representation normalized. The previous lemma also implies that u; 
and u,; are nonparallel if i and j are adjacent, and so for adjacent nodes i and j 
there exists a unique shortest geodesic on S? connecting u; and u;. This gives an 
extension of the mapping i > u, to a mapping 7: G — S?. We will show that 
this is in fact an embedding. 

The main result in this section is the following |Lovasz—Schrijver 1998 


Lovasz 2001): 
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Theorem 16.18. Every Colin de Verdiére matriz of a 3-connected planar graph G 
can be scaled to be polyhedral. 


Proof. The proof will consist of two parts: first we show that the nullspace 
representation with unit vectors gives an embedding of the graph in the sphere; 
then we show how to rescale these vectors to get a Steinitz representation. 


Claim 1. Let F, and F» be two countries of G sharing an edge ab. Let H be the 
plane through 0, ug and uy. Then all the other nodes of F, are in V3 and all the 
other nodes of Fy are in Vy (or the other way around). 


Suppose not, then there is a node c of F, and a node d of F2 such that c,d € Ve 
(say). (Note that by Lemma no node of F, or Fy belongs to Vy.) By 
Corollary[14.11| there is a path P connecting c and d with V(P) C V;}. Both a and 
b have neighbors in Ve , and as we have seen, this implies that they have neighbors 
a’ and 0’ in Vj; as well. By Corollary [14.11] again, there is a path connecting a’ 
and b’ contained in V;,. This gives us a path @ connecting a and 6, with at least 
one internal node, contained in V;, (up to its endpoints). 

What is important from this is that P and Q are disjoint. But trying to draw 
this in the planar embedding of G, we see that this is impossible (see Figure[16.1). 


FIGURE 16.1. Why two countries do not fold. 


Claim 1 implies the following. 


Claim 2. [f1,...,k are the nodes of a country, in this cyclic order, then u,,..., Ux 
are the extremal rays of the convex cone they generate in R°, in this cyclic order. 


Next we turn to the cyclic order around nodes. 
Claim 3. Let a € V, and let (say) 1,...,k be the nodes adjacent to a, in this 
cyclic order in the planar embedding. Let H denote a plane through 0 and ug, and 
suppose that uy © VE and uz, € Vy. Then there is anh, 1 <h< k—1 such that 
Uj,.-.,Upn_1 € Ve and Un+41,.-.,Un © Viz. 

If the Claim fails to hold, then there are two indices 1 <i < j < k such that 
u; € Vy and u; € Ve. By Corollary|14.11} there is a path P connecting 1 and 7 in 
V,, and a path P’ connecting i and k in V,;. In particular, P and P’ are disjoint. 
But this is clearly impossible in the planar drawing. 


Claim 4. With the notation of Claim 8, the geodesics on S* connecting ug to 
Uj,...,Ug issue from u, in this cyclic order. 
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Indeed, let us consider the halfplanes T; bounded by the line @ through 0 and 
Ua, and containing u; (i = 1,...,&). Claim 1 implies that 7;,, is obtained from 
T; by rotation about @ in the same direction. Furthermore, it cannot happen that 
these halfspaces 7; rotate about ¢ more than once for i = 1,...,k,1: in such a case 
any plane through @ not containing any u; would violate Claim 3. 


Claim 5. The normalized nullspace mapping is an embedding of G in the sphere. 
Furthermore, each country f of the embedding is the intersection of a convex poly- 
hedral cone C'f with the sphere. 


To prove this, note that Claims 2 and 4 imply that we can extend the map 
w: G— S$? toa mapping ~: S$? > S? such that w is a local homeomorphism 
(ie., every point 2 € S? has a neighborhood that is mapped homeomorphically to 
a neighborhood of w(zx).) This implies (in the case of the 2-sphere) that 7 is a 
homeomorphism (see Exercise [16.6). 

As a second part of the proof of Theorem we construct appropriate 
scaling of a nullspace representation. The construction will start with the polar 
polytope. Let G* = (V*, E*) denote the dual graph of G. 


Claim 6. We can assign a vector wy to each f € V* so that whenever ij € E and 
fg is corresponding edge of G*, then 


(16.4) Wf Wg = Mj; (uj x u;). 


Let vrg = Mi; (ui x u;). It suffices to show that the vectors vy, sum to 0 over 
the edges of any cycle in G*; then we can invoke the potential argument. Since G* 
is a planar map, it suffices to verify this for the cycles bounding the countries of 
G*. Expressing this in terms of the edges of G, it suffices to show that 


S- Mi; (ui x u;) = 0. 
JEN (i) 
But this follows from the basic equation 5° j M,;u; = 0 upon multiplying by uj, 
taking into account that u,; x u; = 0 and M;,; = 0 for 7 ¢ N(z)U{t}. This proves 
Claim 6. 

An immediate property of the vectors wy is that if f and g are two adjacent 
nodes of G* such that the corresponding countries of G share the node i € V, then 
uw f= u} wy. This follows immediately from (16.4), upon scalar multiplication 
by u;. Hence the inner product u]w y is the same number a; for every country f 
incident with u;. In other words, these vectors wy all lie on the plane ul x = a4. 

Let P* be the convex hull of the vectors wy. 


Claim 7. Let f € V*, and let u€ R®, u#0. The vector wy maximizes u'x over 


P* if and only if ue Cr. 


First suppose that wy maximizes u'z over P*. Let i1,...,i. be the nodes of 


the country of G corresponding to f (in this counterclockwise order). By Claim 2, 
the vectors u;, are precisely the extreme rays of the cone Cy. 
Let fg: be the edge of G* corresponding to i:i441. Then u'(wr—Wg,) > 0 for 


t=1,...,8, and hence by (16.4), 


+ 
u Mision (Ui,., x ui, ) > 0, 
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or 
ul! (uj, x uj,,,) = 0. 


This means that u is on the same side of the plane through u;, and uj,,, as Cy. 
Since this holds for every 7, it follows that u € Cy. 

Conversely, let u € Cy. Arbitrarily close to u we may find a vector u’ in the 
interior of Cy. Let w, be a vertex of P* maximizing (u’)'x over  € P*. Then by 
the first part of our Claim, u € Cy. Since the cones C;,, h € V* are nonoverlapping, 
it follows that f = g. Thus (u’)'z is optimized over x € P* by wy. Since u’ may 
be arbitrarily close to u, it follows that u'z too is optimized by wy. This proves 
Claim 7. 

This claim immediately implies that every vector wy (f € V*) is a vertex of 
P*, 


Claim 8. The vertices wy and w, form an edge of P* if and only if fg € E”. 


Let fg € E*, and let 27 be the edge of G corresponding to fg € E*. Let u 
be a vector in the relative interior of the cone C; Cj. Then by Claim 7, u'z is 
maximized by both wy and wy. No other vertex w,, of P* maximizes u'z, since 
then we would have u € C;, by Claim 7, but this would contradict the assumption 
that u is in the relative interior of Cs C,. Thus wy and wy span an edge of P*. 

Conversely, assume that wy and w, span an edge of P*. Then there is a nonzero 
vector u such that u'x is maximized by w y and wg, but by no other vertex of P*. 
By Claim 7, u belongs to CrNC,, but to no other cone Cz, and thus the cones 
Cy and C, must share a facet, proving that fg € E*. This completes the proof of 
Claim 8. As an immediate corollary, we get the following. 


Claim 9. Every vector wy is a vertex of P*, and the map f ++ wy is an isomor- 
phism between G* and the skeleton of P*. 


To complete the proof, let us note that the vectors wy are not uniquely de- 
termined by (16.4): we can add the same (but otherwise arbitrary) vector to each 
of them. Thus we may assume that the origin is in the interior of P*. Then the 
polar P of P* is well-defined, and its skeleton is isomorphic to G. Furthermore, 
the vertices of P are orthogonal to the corresponding facets of P*, i.e., they are 
positive multiples of the vectors u;. Thus the vertex of P representing node 7 is 
ui, = a;,u;, where a; = u]w y > 0 for any country f incident with 7. 


The scaling of M constructed in this proof is almost canonical. There were 
only two free steps: the choice of the basis in the nullspace of M, and the trans- 
lation of P* so that the origin be in its interior. The first corresponds to a linear 
transformation of P. It is not difficult to show that if this linear transformation is 
A, then the scaling constants are multiplied by | det(A)]. 

The second transformation is a bit more complicated. Translation of P* cor- 
responds to a projective transformation of P. Suppose that instead of P*, we take 
the polyhedron P* +w, then the new scaling constant becomes a4 = u} (wp-+w) = 
a;+u}w. Note that here (u}w : i € V) is just an arbitrary vector in the nullspace 
of M. Thus we get that in the above construction, the scaling constants a; are 
uniquely determined by M up to multiplication by the same positive number and 
addition of a vector in the nullspace of M. 
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If you want a canonical choice, you can make it orthogonal to Ker(M). This 
can be expressed explicitly by solving for w the equation 


SS(aitusw)u; =0, 
7 
which gives 
w =—(UU')"lUa. 
Of course, there might be many other scalings that yield polyhedra. For ex- 
ample, if P is simple (every facet is a triangle), then any scaling “close” to the one 
constructed above would also yield a Steinitz representation. 


16.2.4. From Steinitz representations to Colin de Verdiére matrices. 
Let P be any polytope in R?, containing the origin in its interior. Let G be the 
skeleton of P. Let P* be its polar, and G* = (V*, E*), the skeleton of P*. 
Let u; and u; be two adjacent vertices of P, and let wy and w, be the endpoints 
of the corresponding edge of P*. Then by the definition of polar, we have 
u; wy UW, uj wy uj Wy ile 
This implies that the vector wf —w, is orthogonal to both vectors u; and u;, and 
hence it is parallel to the vector u; x u;. Thus we can write 


Wf — Wg = Mij(ui x uy), 
where the labeling of wy and w, is chosen so that Mj; < 0; this means that u,, u; 


and wy, form a right-handed basis. 
Let 2 € V, and consider the vector 
u; = »: Mj ju;. 
JEN (i) 
Then 
uj Xx u; = > Mj; (ui x u;) = So (wy — Wy) = 0, 
JEN (#) 
because the last sum extends over all edges fg of the facet of P* corresponding to 
é, oriented counterclockwise, and hence it vanishes. Thus u; and uj are parallel, 
and we can write 


with some real M;;. We complete the definition of a matrix M = M(P) by setting 
M;; = 0 whenever 7 and j are distinct nonadjacent nodes of G. 


Theorem 16.19. The matriz M(P) is a Colin de Verdiére matrix for the graph 
G. 


Proof. It is obvious by construction that M has the right pattern of 0’s and 
negative elements. (16.5) can be written as 


LS Mju; = 0, 
J 


which means that each coordinate of the u; defines a vector in the null space of M. 
Thus M has corank at least 3. 

For an appropriately large constant C > 0, the matrix CI —M is nonnegative 
and irreducible. Hence it follows from the Perron—Frobenius Theorem that the 
smallest eigenvalue of M has multiplicity 1. In particular, it cannot be 0 (which we 
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know has multiplicity at least 3), and so it is negative. Thus M has at least one 
negative eigenvalue. 

The most difficult part of the proof is to show that M has only one negative 
eigenvalue. Observe that if we start with any true Colin de Verdiére matrix M of 
the graph G, and we construct the polytope P from its null space representation as 
in Section [16.2.3] and then we construct a matrix M(P) from P, then we get back 
(up to positive scaling) the matrix M. Thus there is at least one polytope P with 
the given skeleton for which M(P) has exactly one negative eigenvalue. 

Steinitz proved that any two 3-dimensional polytopes with isomorphic skeletons 
can be transformed into each other continuously through polytopes with the same 
skeleton (see ). Each vertex moves continuously, and hence 
so does their center of gravity. So we can translate each intermediate polytope so 
that the center of gravity of the vertices stays 0. Then the polar of each intermedi- 
ate polytope is well-defined, and also transforms continuously. Furthermore, each 
vertex and each edge remains at a positive distance from the origin, and therefore 
the entries of M(P) change continuously. It follows that the eigenvalues of M(P) 
change continuously. 

Assume that M(Po) has more than 1 negative eigenvalue. Let us transform P5 
into a polytope P, for which M(P,) has exactly one negative eigenvalue. There is 
a last polytope P; with at least 5 nonpositive eigenvalues. By construction, each 
M(P) has at least 3 eigenvalues that are 0, and we know that it has at least one 
that is negative. Hence it follows that M(P,) has exactly four 0 eigenvalues and 
one negative. But this contradicts Proposition [14.12] 

Next we show transversality. Let X be a nonzero G-matrix with zero diagonal, 
and assume that MX =0. Every column of X is in the nullspace of M. We know 
that this nullspace is 3-dimensional, and hence it is spanned by the coordinate 
vectors of the u;. This means that there are vectors h; € R® (i € V) such that 
Xi; = ujh;. We know that X;; = 0 if 7 = i or j € N(i), and so h; must be 
orthogonal to u, and all its neighbors. Since u; and its neighbors obviously span 
R°, it follows that X = 0, a contradiction. 

Finally, it is trivial by definition that M is polyhedral. 


16.2.5. Further geometric properties. Let M be a polyhedral Colin de 
Verdiere matrix of G, and let P be the corresponding convex 3-polytope. Let P* 
be the polar polytope with 1-skeleton G* = (V*, E*). The key relations between 
these objects are: 


(16.6) y Miju; = 0 for every 1 € V, 
J 


and 
(16.7) wy—w, = M;;(u; x u,) for every ij € E and corresponding fg € E*. 


There are some consequences of these relations that are worth mentioning. 
First, taking the vectorial product of (16.7) with w,, we get (ws — wy) x Wy = 
Wy X Wy on the left hand side and 


on the right. Thus 
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The vector h; = (1/|u;|?)u; is orthogonal to the facet i of P* and points to a 
point on the plane of this facet. Taking the scalar product of both sides of 
with h;, we get 

hj (wy x Wy) = Mi; (hju; = iL), 
The left hand side is 6 times the volume of the tetrahedron spanned by hy, wy and 
w,. Summing this over all neighbors 7 of i (equivalently, over all edges fg of the 
facet i of P*), we get 6 times the volume V; of the cone spanned by the facet 7 of 
P* and the origin. On the right hand side, notice that the expression is trivially 0 
if 7 is not a neighbor of 7 (including the case j = 7), and hence we get by 


a Mi; (hj uy -lj= h/ S- Mju; -So My =—- S- Mi;- 
j j 


J J 


Thus we get 
(16.9) S> Mi; = —6V;. 
J 


Note that by (16.6), 
1 
Svm=-§ Li {My | w=-gF LD Mow =o. 
i a j qi a 


This is a well known fact, since 3V;u; is just the area vector of the facet 7 of P*. 
Equation gives the following relation between the Colin de Verdiére ma- 
trices of a 3-connected planar graph and its dual. Let M be a polyhedral Colin de 
Verdiére matrix for a planar graph G. Let P be a polytope defined by M. Let Vy 
denote the volume of the cone spanned by the origin and the facet f. If27 € EF inci- 
dent with facets f and g, then let MF, = 1/Mij. Let Mj, = —V;— Dl oentp) Mj, 
finally, let M;, = 0 if f and g are distinct nonadjacent facets. These numbers form 
a symmetric matrix M* € RY *Y", 
Corollary 16.20. The matrix M™* is a polyhedral Colin de Verdiére matrix of the 
dual graph G*. 


16.3. Linkless embeddings 


In this section we return to part (d) of Theorem|16.15} The condition is more 
involved than the others, and needs an explanation, which we provide before turning 
to the proof. 


16.3.1. Preliminaries: topology. Every (finite) graph can be embedded in 
3-space, but there are ways to distinguish “simpler” embeddings. First, we have 
to define when two disjoint Jordan curves J, and Jz in R® are “linked”. There are 
several nonequivalent ways to define this. Perhaps the most direct (and let this 
serve as our definition) is that two curves are linked if there is no topological 2- 
sphere in R° separating them (so that one curve is in the interior of the sphere, while 
the other is in its exterior). This is equivalent to saying that we can continuously 
shrink either one of the curves to a single point without ever intersecting the other. 

An alternative definition uses the notion of linking number. We consider an 
embedding of the 2-disc D in R*? with boundary J,, which intersects Jp a finite 
number of times, and always transversally. Walking along Jz, we count the number 
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of times D is intersected from one side to the other, and subtract the number of 
times it is intersected in the other direction. If this linking number is 0, we say 
that the two curves are homologically unlinked. Reasonably elementary topological 
arguments show that the details of this definition are correct: such a 2-disc D exists, 
the linking number as defined is independent of the choice of D, and whether or not 
two curves are homologically linked does not depend on which of them we choose 
as J}. 

In a third version, we count all intersection points of Jz with the disk, and 
we only care about the parity of this number. If this parity is even, we say that 
the two curves are unlinked modulo 2. If two curves are unlinked, then they are 
homologically unlinked, which in turn implies that they are unlinked modulo 2. 
(Neither one of these implications can be reversed.) 

An embedding of a graph G in R? is called linkless if any two disjoint cycles in 
G are unlinked. A graph G is linklessly embeddable if it has a linkless embedding 


in R3 (Figure [16.2). 


FIGURE 16.2. Left: A graph embedded in 3-space with two cycles 
linked. Right: An apex graph (a planar graph with an additional 
node). This embedding is linkless (even flat). 


There is an even stronger definition of linkless embeddings: An embedding of 
G in R° is called flat, if for each cycle C in G there is a disk D bounded by C (a 
‘panel’) whose interior is disjoint from G. 

Clearly, each flat embedding is linkless in any of the senses above, but the 
converse does not hold in general. (A trivial example: embed a single cycle in 
3-space so that the image is a proper knot.) However, if G has an embedding 
that is linkless in the weakest sense (the linking number of any two disjoint cy- 
cles is even), then it has a flat embedding (linkless in the strongest sense) as well. 
So the class of linklessly embeddable graphs is very robust with respect to varia- 


tions in the definition. This fact follows from |Robertson—Seymour—Thomas 1993 
Robertson—Seymour—Thomas 1993}, as a byproduct of Theorem |16.21] below. 


Suppose that we have a graph G linklessly embedded in 3-space (in any par- 
ticular sense). It is trivial that if we delete an edge from G, then the embedding 
remains linkless. It takes a bit more effort to verify that if we contract an edge e, 
then the resulting graph G/e is linklessly embeddable: informally, we can contin- 
uously deform the embedding to make the edge e shorter and shorter, until in the 
limit we get an embedding of G/e. Two linked cycles in G/e would yield two linked 
cycles in G. So linkless embeddability is a minor-closed property. 

For either one of the versions of linkless embedding, if a graph is linklessly 
embedded, and we perform a A—Y or Y —A transformation, then there is a way 


306 16. THE COLIN DE VERDIERE NUMBER 


to define an embedding of the new edges so that the resulting graph is linkless as 
well. This is quite easy to see for Y—A, and a bit more difficult for A—Y (see 
Exercise [16.8). 

A basic result in the graph minor theory of Robertson, Seymour and Thomas 
says that every minor-closed graph property can be characterized by a finite 
number of forbidden minors. To find this list explicitly is, in general, very dif- 
ficult, and it is very remarkable that, verifying a conjecture of Sachs, a deep 
forbidden minor characterization of linklessly embeddable graphs was proved in 


Robertson—Seymour—Thomas 1993}. 


Theorem 16.21. A graph G is flatly embeddable in 3-space if and only of it does 
not contain any of the graphs in Figure[16.5, as @ minor. 


Be 
iy yy 


FIGURE 16.3. Excluded minors for linklessly embeddable graphs 
(the Petersen Family). 


We note that the graphs in the Petersen family (Figure [16.3) cannot be link- 
lessly embedded even in the weakest sense: every embedding of them contains two 
disjoint cycles with an odd linking number. Exercise[16.9]gives a hint how to prove 
this in the case of Kg; the other graphs in the Petersen family arise from Kg by 
A-—Y and Y —A transformations, which preserve this property. 

The rest of this section is devoted to the proof of part (d) of Theorem 
giving a different characterization of linklessly embeddable graphs. 


16.3.2. Preliminaries: simultaneous paneling. We do not claim that ina 
flat embedding, the panels belonging to different cycles are disjoint from each other. 
For example, let us take the graph K22.2, embedded in 3-space as the skeleton of the 
regular octahedron. This embedding is flat (the skeleton of any convex 3-polytope 
is), but one cannot panel the facets and the three 4-cycles without creating some 
intersection between the interiors of the panels. If each member of a family of 
cycles can be paneled so that the interiors of the panels are disjoint, we say that 
this family can be simultaneously paneled. Reasonably large families of cycles can 


be paneled simultaneously, as the following result of |Bohme 1990} shows: 


Lemma 16.22 (Paneling Lemma). Let C be a family of cycles in a flatly embed- 
ded graph G. Assume that the intersection of any two cycles in C is either a path 
or empty. Then the cycles in C can be simultaneously paneled. 


We need the following special case of the paneling lemma. Let T be a spanning 
tree of a flatly embedded graph G. For each edge e € E'\ F(T), let C. be its 
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fundamental cycle (the unique cycle in T +e). The cycles C, satisfy the conditions 
of the Paneling Lemma, and so they can be simultaneously paneled. We call this 
paneling a fundamental paneling with respect to T. 

Let Y be the union of the panels in a fundamental paneling. It is easy to see 
that Y, as a topological space, is contractible. This implies, in particular, that 
every cycle of G can be contracted to a single point inside Y, and also that the 
complement of Y is connected. 

As an application of the Paneling Lemma, we prove a fact about extending 
linkless embeddings. 


Lemma 16.23. Let G be a flatly embedded graph, and let S C V induce a connected 
subgraph. Define G, = G[SUN(S)] and Gz = G\S, and let i,j € V(G2). Then the 
embedding can be extended to an embedding of G+17 in which no cycle in Go+ij 
is linked with any cycle in Gy. 


Proof. Consider a spanning tree of G[S], and extend it to a spanning tree T of 
G, so that all nodes in N(S) are leaves of T. Consider a fundamental paneling of 
G, with respect to T (as described after the Paneling Lemma|16.22), and let Y be 
the union of these panels. As remarked, the complement of Y is arcwise connected, 
and hence we can connect 7 and j by a Jordan arc J whose interior is disjoint from 
Y. We may perturb J a little to make its interior disjoint from the rest of G as 
well. 

We claim that using J to embed the edge ij, we get an embedding of G+7j 
that satisfies the conditions of the lemma. Indeed, let C; be a cycle in G; and 
C2, a cycle in Gg+7j disjoint from C,. We want to prove that C; and C2 are not 
linked. We may assume that i7 € E(C2) (else, the assertion is trivial since G is 
flatly embedded). 

If an edge e € E(G;) is disjoint from C2, then the whole cycle C, is disjoint 
from C2, since all the nodes of C,. other than the endpoints of e are contained in 
S. Hence the panel of the cycle C. is disjoint from C2 as well. Since C; can be 
contracted inside the union of these panels, it cannot be linked with C2. 


16.3.3. Preliminaries: polytopes. We need a couple of results about poly- 
topes. First, a technical lemma. Let us say that a segment xy crosses a hyperplane 
H, if x and y belong to different sides of H. In other words, the segment intersects 
A ina single point that is an interior point of it. 


Lemma 16.24. Let F be a finite family of linear hyperplanes in R¢. Then there 
is a centrally symmetric simplicial d-polytope P such that no edge of P crosses any 
hyperplane in F. 


Proof. If P is a convex polytope containing the origin in its interior, and x 
is a point on its boundary, then pulling x is the operation of replacing P with the 
convex hull P’ of P and (1+¢)x. Here we assume that ¢ > 0 is small enough so 
that for every facet not containing x, the point x’ = (1+¢)x is on the same side 
of the facet as P. If P is simplicial, then pulling preserves this property. The new 
point x’ is a vertex; every other vertex remains a vertex of P’, except if x was a 
vertex of P. Every edge of P remains an edge, except if x was an interior point of 
an edge e of P; in this case, e ceases to be an edge. New edges connect x’ to all 
vertices on faces containing x (Figure [16.4). 

If P is centrally symmetric (with respect to the origin), then we can pull both 
x and —x by the same «e to keep central symmetry. 
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FIGURE 16.4. Left: an octahedron and a plane through its center 
intersecting two edges. Right: Pulling the intersection points of 
the edges and the plane to create new vertices, but eliminate the 
intersections. The illustration is in 3 dimensions, in the proof we 
need 5. 


Starting with any simplicial centrally symmetric polytope in R@ (say, with the 
cross-polytope), we pick a hyperplane H € fF, and pull all intersection points of H 
with the interiors of edges, maintaining central symmetry. Let us call this “cleaning 
H”. We clean all hyperplanes in F one-by-one, to get a polytope P’. 

We claim that P’ satisfies the requirements of the lemma: cleaning a hyperplane 
H gets rid of all crossings between edges and H without creating new crossings with 
hyperplanes already cleaned. Indeed, when pulling an interior point x of an edge 
e, all the new edges are incident with x’, and so they do not cross H. Let H’ € F 
be a hyperplane already cleaned. If H’ contains x, then it contains x’, and so no 
new edge crosses it. If H’ does not contain x, then every face containing x is fully 
contained in the closed halfspace bounded by H’ containing x (since H’ is already 
cleaned). Since x’ is contained in this closed halfspace as well, no new edge will 
cross H’. 


Next, we relate polytopes and the topological considerations in the previous 
section. We have seen that the skeleton of any 3-polytope, as a graph embedded in 3- 
space, is linkless (even flat). Among skeletons of 4-polytopes we find both linklessly 
embeddable ones (say, Ks5), and nonembeddable ones (every complete n-graph, 
n > 5, is the skeleton of a 4-polytope). The following results [Lovasz—Schrijver 1998} 
imply that the skeleton of a 5-polytope is never linklessly embeddable. 

To state the result, we need a definition from convex geometry. Let P be a 
d-polytope. We say that two faces F, and F, of P are antipodal, if there are two 
parallel supporting hyperplanes H; and H2 such that F, C Hy and F2 C He (Figure 
(16.5). It is easy to see that every d-polytope has antipodal faces F, and F> with 
dim(F)+dim(F)) = d—1: for example, we can take a facet and a vertex farthest 
from its hyperplane. 

We call two faces A; and Ag parallel if their affine hulls are disjoint but these 
affine hulls contain parallel lines. Formally, aff(A1)Maff(A2) = @ but lin(A; — A1)/N 
lin(Az— Az) # 0. Two antipodal faces with dim(F,)+dim(F2) > d are always 
parallel. 

It will be convenient to work with polytopes with no parallel faces. This can 
be achieved by taking a hyperplane H that it disjoint from P, does not contain any 
nonempty intersection aff(A,)Maff(A2), and is not parallel to any nonnull linear 
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FIGURE 16.5. Antipodal faces of a polytope 


subspace lin(A; — A;)Mlin(Ag—A2). Moving H to infinity by a projective trans- 
formation maps P onto a polytope with the same lattice of faces but with no parallel 
faces. 

One advantage of having no parallel faces is that for two antipodal faces F, and 
Fy, there are two parallel supporting hyperplanes H; and H2 such that F; = H; 1 P 
(Exercise [16.11). A further simple property we need is the following. 


Lemma 16.25. Let P be a d-polytope having no parallel faces, and let A, and Ag 
be antipodal facets with dim(A,)+dim(Az) = d—2. Then the number of pairs of 
antipodal facets {F, F)} with Ay C Fi, Ao C Fo and dim(F,)+dim(F2) = d—-1 is 
2. 


Proof. Let H, and Hz be two parallel supporting hyperplanes such that A, = 
AAP and Ap = H2NP. Let L; = lin(A;—A;), and let H be the hyperplane 
obtained by translating H, to the origin. Then L191 L2 = 0 (since A; and A» are 
not parallel) and L;+L2 C H. Since dim(L1+L2) = d—2 = dim(H)—1, we can 
rotate H about [+ L2 in two directions. This corresponds to simultaneous rotation 
of H; and Hz keeping them parallel and preserving A; C H;. In both directions, 
we can rotate until one of the H; hits a face F; of P with dim(F;) > dim(A;). 

Now the hypothesis that P has no parallel faces has some useful consequences. 
It cannot happen that both H; and Hp hit higher-dimensional faces F, and Fy» 
simultaneously, since these would be antipodal faces with dim(F))+dim(F2) > d, 
and therefore parallel. By a similar argument, we must have dim(F;) = dim(A;) +1, 
and so {F;,A3_;} is an antipodal pair with dim(F;)+dim(A3_;) = d—1. So 
rotation in either direction yields such a pair, two of them altogether. It is easy to 
see that no further such pair exists. 


A continuous map ¢: OP — R™ will be called generic, if the images of the 
interiors of a k-face and an l-face do not intersect if k+1 < m, these images have a 
finite number of intersection points if k+l =m, and at these points they intersect 
transversally. 

Let F’ denote the relative interior of the face F. For a vertex point x, we define 
X=x. 

Lemma 16.26. Let P C R®@ be a conver polytope having no parallel faces and 
let @: OP — Rt be a generic continuous map. Then S~|o(Fi)N¢(F2)| is 


odd, where the summation extends over all antipodal pairs {F\, F2} of faces with 
dim(F,)+dim(F2) =n-1. 


In particular, there exists a pair of antipodal faces F, and F) with dim(F))+ 
dim(F,) = d—1 such that |¢(F,)N¢(F2)| is odd. This extends a result of 
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Bajmoczy—Barany 1979], which asserts the existence of such a pair of faces with 
b:(Fi) Gi (F2) FO. 


Proof. There is at least one map for which this is true: the map @¢p9 defined 
by projecting OP onto a facet F' of P from a point outside P but very close to an 
interior point of F' (this map was used to construct the Schlegel diagram). By the 
assumption that P has no parallel faces it follows that there is a unique vertex v 
antipodal to F. Then >> |¢o(F1) N¢o(F2)| = 1, since the only nonzero term in the 
sum is obtained when {F, Fp} = {F,v}. 

Now we deform ¢p through a family of maps ¢;: OP > R&! (0 <t <1) 
so that ¢1 is an arbitrary generic continuous map. We need that each ¢; has the 
following properties. Let A; be a k;-dimensional face of P (4 = 1,2). 


(i) If ky tke < d—2, then ¢;(Aj1) and ¢;(Az) do not intersect. 


(ii) If ky +ko = d—2, then ¢;(A,) and ¢;(Ag) intersect for a finite number of 
values of t only, and then |¢,;(A,)N¢@;(Ag)| = 1, and this happens only for one pair 
of faces at the same time. 

(iii) If ky +ko = d—1, then ¢;(A,) and ¢;(Az2) have a finite number of inter- 
section points, and at these points they intersect transversally. 

To see that such a deformation exists takes some (routine) arguments in topol- 
ogy, which are omitted. 

The number |¢;(F1)N¢;:(F2)|, where dim(F,)+dim(F)) = d—1, does not 
change unless o:(F1) intersects (Ae), where A» is a facet of Fb, or the other 
way around. So t must be one of the special values from (ii), and in fact the inter- 
secting pair A; and A» must be antipodal. Fixing this A; and Ag, the target sum 
>> |¢e(F1) Nd: (F2)| changes by 1 for every pair {F,, F)} of antipodal faces with 
dim(F,)+dim(F)) = d—1 such that A; C Fy and Ag C Fy. By Lemma [16.25] 
there are two such pairs, so the sum changes by an even number. 


Now we are able to state and prove our main topological tool, establishing the 
connection between linear algebra (convex polytopes) and linking cycles. The fact 
may be interesting in itself as a Borsuk-type theorem about linking. The theorem 
extends to polytopes in any odd dimension with essentially the same proof (see 


Lovasz—Schrijver 1998] for the details). 


Theorem 16.27. Let P be a convex polytope in R®° and let ¢ be an embedding of 
its 1-skeleton into R°. Then there exist antipodal 2-faces F, and Fy, of P such that 
@(OF,) and d(OF2) have an odd linking number. 


Proof. Let us extend ¢ to a continuous mapping ¢’ : OP — R* (this is clearly 


possible) and define 
o' (x) 
wo= (4) 
where f(x) = 0 if x € P, and f(x) > 0 otherwise (for example, f(x) can be the 
distance of x from P,). We can perturb f a little to make w generic. 

By Lemmaj16.26} P has a pair of antipodal faces F and F) such that dim(F,) + 
dim(F2) = 4 and |¥(F,)Ny(F2)| is odd. Note that the intersection points of ~(F) 
and w(F2) cannot be between 7(P,) and ~(OP \ P,;), since the former is contained 
in the hyperplane x4 = 0, while the latter is contained in the halfspace x4 > 0. So 
we must have dim(F) = dim(F) = 2. 
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We claim that w(OF)) and w(OF2) have an odd linking number. Let #1: Fi > 
R? be an extension of wlar,, such that w(F,) intersects G transversally a finite 
number of times. Let w2: F2 > R* be an extension of wlar,, such that the fourth 
coordinate of every point of 72(Fo is negative. Then Ylap,Uy, and lap, Uv2 
can be thought of as two maps of the 2-sphere into R+. The common points of 
the images are the intersection points of W(F,) and w(F), and the intersection 
points of 7 (F,) and ~(OF)). The number of these intersections must be even. 
Since |¢)(F,)Nw(F2)| is odd, so is |¢1(F,)Nv(OF2)|, which proves that ¢(OF,) and 
(OF) have an odd linking number. 


16.3.4. Completing the proof: Nullspace representation. We want to 
prove that a graph G is linklessly embeddable if and only if u(G) < 4. The fact 
that this condition is sufficient was noted by Robertson, Seymour and Thomas. 
Indeed, if a graph is not linklessly embeddable, then by Theorem [16.21] it contains 
one of the graphs in the Petersen family (Figure[16.3) as a minor. All these graphs 
have p(G') = 5. This can be verified directly, or it can be justified by the following 
argument: u(Ke) = 5 by Example and hence Theorem implies that 
u(G) = 5 for every graph that arises from Kg by A—Y and Y —A transformations. 
It is not too tedious to check that these are exactly the graphs in the Petersen 
family. 

So a graph G that is not linklessly embeddable has a minor with uw = 5; by 
minor monotonicity, this implies that u(G) > 5. 

To prove the converse, let G be a linklessly embeddable graph, and fix a flat 
embedding of G in R°. Suppose (by way of contradiction) that u(G) > 5. Let 
u: V — R?° be the nullspace representation obtained from a Colin de Verdiére 
matrix M of G. By Lemma [16.24] there exists a centrally symmetric polytope 
P C R? that no edge of P crosses any of the hyperplanes uj, (u;+u,)+, or 
(u;—u,)+. In other words, for every proper face F of P, any of these hyperplanes 
either contains F or is disjoint from its interior. 

We are going to work with three geometrically represented graphs. First, there 
is the linkless embedding of G in R*; second, there is the nullspace representation of 
M in R® (it may be useful to also keep in mind the arrangement of the hyperplanes 
uz); and third, we will consider the polytope P and its 1-skeleton P,, also in R°. 
Our goal is to map P; into R? continuously. Roughly speaking, we want to map P; 
into the graph G (as embedded in R?). T heorem|16.27] will provide a pair of linked 
cycles in G. 

For every vector x € R®, let Vo(x) = {ie V: x'u; = O}, Vi(x) = {1 € 
V: x'u; > 0} and V_(x)={ieV: x'u; < 0}. For every vertex v of P, let d(v) 
be any node in V,(v) (which is nonempty by the Discrete Nodal Theorem [14.10). 

To extend the map ¢ to the edges of P, we need a simple observation. 


Claim 1. Let F be a face of P, let z be an interior point of F, and let i © V_ be 
any node of G. If zu; = 0, then v'u; = 0 for every verter v € V(F); if z"u; > 0, 
then v'u; > 0 for every verter v € V(F), and vlu; > 0 for at least one vertex 
veV(F). 


This follows from the fact that the hyperplane u+ does not separate any two 
vertices of F’. 

Since v Ug) > 0 by the definition of ¢(v), it follows that Z"Ug(v) > 0 for 
every vertex v € V(F’). In other words, ¢(v) € V,(z). 
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Now we describe how to extend the map ¢ to the edges of P. Let vw be an 
edge of P, let i = ¢(v) and j = ¢(w), and let z be an internal point of the edge vw. 
Then 7,7 € Vi(z) by Claim 1. In the (easy) case when 7 and j belong to the same 
connected component of G[V+(z)], we can find a path of G in V;(z) connecting i 
and j, and map the edge vw onto this path Figure [16.6). 


Pc RS StU GcR°® 


FIGURE 16.6. Illustration of constructing the map ¢. Top middle: 
nullspace representation u in R°. Left: the polytope P C R® with 
a hyperplane u+. Right: G embedded in R°. 


The more difficult case is when i and 7 belong to different components of 
G[V(z)]. Such an edge of P is called broken. This in particular implies that 
G[V,(z)] is disconnected, so the exceptional case (ii) applies in the Discrete Nodal 
Theorem |14.10] (it is useful to look at the geometric description at the end of that 
section). Thus G[Vo(z)] has three connected components H,, Hz and H3, where z 
is positive on H; and Ho, and negative on H3. We may assume that 7 € V(A1) and 
j € V(H2). By Lemma|16.23| we can extend the embedding of G to an embedding 
of G+ij so that no cycle in G[V(H3)UN(V(H3))] is linked with any cycle in 
(G+i7)\V(H3). We map the edge vw of P onto the edge 77 of G+ij. 

We carry out this construction for every broken edge of P,; we can make sure 
that the new edges added are disjoint (except for their endpoints). Thus we have 
extended G to a graph G embedded in R? (this embedding will not be linkless in 
general). This gives a continuous mapping ¢: P; — R%. We can perturb this 
mapping a very little to get a continuous injective mapping @. 


Claim 2. Let vw be a broken edge of P. Then the 3-dimensional subspace {v, w}+ 
is generated by the vectors u; contained in it. 


Indeed, by the Nodal Lemma, there is a 3-dimensional subspace S$ C z+ (where 
z is an internal point of the edge vw), and this is generated by the vectors u; 
contained in it. Since every hyperplane u; containing z also contains v by Claim 
1, it follows that v € S+. Similarly w € $+, and hence S = {v,w}+. 


Claim 3. For every face F of P, the faces F and —F together have at most one 
broken edge. 


Suppose that vw and v’w’ are different broken edges of a face F’. By Claim 2, 
{v,w} is generated by three vectors u;,, u;, and u,,, and similarly, {v’, w’}+ is 
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generated by three vectors uj,, uj, and uj,. Since {v,w}t 4 {v’,w’}+, we may 
assume that v'u,;, 40 and vu, ~ 0. But then the edge vv’ of F crosses one of 
the hyperplanes (u;,+u,,)> or (u;, —u;,)+, a contradiction. 

A similar argument applies when v’w’ is an edge of —F different from the an- 
tipode of vw. The antipode of a broken edge is not broken, since ¢(—vb), d(—wb) € 
V,(—z) = V_(z), and Viz) induces a connected subgraph by (ii) of the Discrete 
Nodal Theorem [14.10] again. This proves the claim. 

The map ¢: P, — R? is not necessarily injective, but we can modify it by 
arbitrarily little to get an injective continuous map w: P, — R°. By Theorem 
P has two antipodal 2-dimensional faces Ff and —F such that the cycles 
w(C) and w(—C) have an odd linking number, where C = 0(F). 


Claim 4. The subgraphs ¢(C) and ¢(—C) of G are node-disjoint. 


Indeed, let z be any interior point of F’, then Z"Ug(v) > 0 for every vertex 
v € V(P) by Claim 1, ie., d(v) € V;(z). It follows by a similar argument that if f 
is an unbroken edge of F, then V(#(f)) € V(z). Since C has at most one unbroken 
edge by Claim 3, whose image is a single edge of G, we have V(¢(C)) © Vi). 
Similarly V(¢(—C)) € V_(z), hence V(¢(C)) NV (¢(—C)) = 0 as claimed. 

Now it is easy to complete the proof. Since the closed curves ¢(C) and ¢(—C) 
are disjoint, and they are very close to the closed curves w(C) and w(—C) with an 
odd linking number, it follows that they have an odd linking number. This implies 
that they contain cycles A C ¢(C) and B C ¢(—C) with an odd linking number. 

If neither F’ nor —F' has a broken edge, then A and B are two linked cycles in 
G, which contradicts the assumption that G is flatly embedded. 

By Claim 3, this leaves us with the alternative that F (say) contains a single 
broken edge vw, and —F' contains none. Let i = ¢(v) and j = ¢(w), and let z be 
an interior point of the edge vw. Then V(A) C V(¢(C)) C V;(z), and similarly 
V(B) C V_(z). Since we are in the exceptional case (ii) of the Nodal Theorem, 
S = V_(z) induces a connected subgraph containing B, and A is a subgraph of 
G[V \S]+i7. By the construction of ¢(vw), these two cycles cannot be linked, a 
contradiction again. 


16.4. Engaged representations 


The Colin de Verdiére number can be characterized in terms of 
a geometric representation that is similar to orthogonal representations 


Kotlov—Lovasz—Vempala 1997|. This will lead to the proof of Theorem |16.16 
A vector-labeling u: V — R¢ of a graph G will be called 1-engaged, if 


(16.10) u;u; {= : : cae 

<1 ifwek#. 
We impose no condition for i = Jj; if we have, in addition, that |u;| > 0 for every 
node 7, then we call the 1-engaged representation strong. 

It is useful to compare l-engaged representations with orthogonal represen- 
tations. We can think of orthogonal representations as an attempt to write the 
adjacency matrix as a Gram matrix, sticking to the 0’s in nonadjacent positions 
(but allowing arbitrary entries in the diagonal and in adjacent positions). The 
above definition can be thought of as trying to write the adjacency matrix as a 
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Gram matrix, where the 1’s in adjacent positions are required, the diagonal is ar- 
bitrary, and the 0’s in nonadjacent positions are relaxed to be anything less than 
1. 


Example 16.28 (Planar triangulations). Recall from Section [5.5.3] that every 
planar triangulation with no separating 3- and 4-cycles has a representation by 
orthogonal proper caps on the sphere. Each cap has a unique pole (the point in 
space from which the part of the sphere you see is exactly the given cap). An 
important observation is that two circles are orthogonal if and only if the corre- 
sponding poles have inner product 1 (Figure[16.7). This translates a representation 


with orthogonal circles into a strong 1-engaged representation. 4 
C; U; 
G 4j 


It 


FIGURE 16.7. Poles of circles. If the two circles are orthogonal, 
then the pole of each circle is contained in the plane of the other, 
and the poles have inner product 1. 


Example 16.29 (Edge-transitive polytopes). Let P be a polytope in R¢ whose 
congruence group is transitive on its edges. We may assume that 0 is its center of 
gravity. Then there is a real number c such that u'v = c for the endpoints of every 
edge. We will assume that c > 0 (this excludes only very simple cases, see Exercise 
10.16). Then we may also assume that c = 1. 

We have to verify that u'v < 1 for any two nonadjacent vertices. Suppose 
not, then 1 < u'v < |u||v|, and so we may assume that |u| > 1. Let w be the 
vertex of P maximizing the linear function u'x. If w 4 u, then there is a path on 
the skeleton of P from u to w along which the linear functional u'x is monotone 
increasing. Let z be the first vertex on this path after u, then u'z > ulu > 1, 
contradicting the fact that uz is an edge of P. 

So we know that u'w < u'u for every vertex w. There is a path on the skeleton 
of P from v to u along which the linear functional u'x is monotone increasing. Let 
y be the last vertex on this path before u, then u'y > u'v > 1, a contradiction 
again since uy is an edge. 

So we know that u'v = 1 for the endpoints of every edge, and u'v < 1 for two 
nonadjacent vertices. Thus the vertices of the polytope provide a strong l-engaged 
representation of its skeleton. 4 


The next lemma implies that if the 1-engaged representation is strong, then the 
representing vectors are the vertices of a convex polytope, and the edges are edges 
of this polytope (perhaps not every edge). We only state this lemma for twin-free 


graphs; see |Kotlov—Lovasz—Vempala 1997] for a version covering the general case. 
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Lemma 16.30. Let G be a twin-free graph without isolated nodes. Letu: V > R4 
be a 1-engaged representation of G. Let P be the convex hull of the vectors u; (i € V) 
and the origin 0, and let P’ be the union of all faces of P not containing 0. For 
i€V, define Q; = {x € P: u]x = 1}. 

a) 

b) If |u;| > 1, then u; is a vertex of P. 

c) Ifij € E, then either u; or u; is a vertex of P. If both of them are vertices 
of P, then the segment connecting u; and uj; is an edge of P. 


The vectors u; are nonzero and different from each other. 


d) If u; is not a vertex of P, then Q; is a face of P, whose vertices are exactly 
the vectors representing the neighbors of %. 


e) Q5 FQ: for j Fi. 
Proof. (a) The assumption that G has no isolated node implies that for every 
node i there is a node j with u}u; = 1, and hence u; 4 0. Since (16.10) determines 


the neighbors of a node 7 once the vector u; is given, the assumption that G is 
twin-free implies that no two nodes are represented by the same vector. 


(b) This follows from the observation that the plane defined by u]x = 1 sepa- 
rates u; from every other u,; and also from 0. 


(c) We have u/u, = 1, which implies that one of u; and u; is longer than 1, and 
hence it is a vertex of P by (b). Let, say, ju;| > 1. If both u; and u; are vertices 
of P, but [u;,u,] is not an edge of P, then some point au;+(1—a)u; (0 < a <1) 
can be written as a convex combination of other vertices: 


au;+(l—a ju; = = S- AKUk, 
kAi,j 


where Ay > 0 and 5°, Ax = 1. Multiplying by uj, we get 


au) u;+ +(1- a)u; uj = So Acujur < SOAR <1. 


kAi,j k#i,j 
On the other hand, 


au, u;+(1—a)uju; = a(juj?—-1)+1 > 1, 
a contradiction. This proves (c). 


(d) By (a), we have |u;| < 1, and so uJu; < 1 for every vertex of P. Hence the 
hyperplane {x : u]x = 1} supports P, and it touches P at a face Q;. Condition 
implies that among the nodes of G different from 7, all neighbors of 2, 
and only those, are contained in Q;. It follows by (b) that every neighbor of 7 is 
represented by a vertex of P, and hence, by a vertex of Q;. The vector u; may 
or may not belong to Q;, but it is not a vertex of it (since it is different from its 


neighbors). 


(e) If Q; = Q, then 7 and j are twins. 


Corollary 16.31. If a twin-free graph G has a1-engaged representation in R¢, then 
it is isomorphic with a subgraph of the skeleton of a d-polytope. If the 1-engaged 
representation is strong, then the convex hull of the representing vectors can serve 
as this polytope. 
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Proof. Let ¢ > 0 be a sufficiently small number. For every node i with |u;| < 1, 
let ui, be a point outside P, at a distance e from Q;, and at distance e? from P. For 
nodes 7 with |u,;| < 1, let uw; = u;. Then every vector ui, is a vertex of the polytope 
Q =conv(u'(V)), and every edge of G corresponds to an edge of Q. 


The assertion of the theorem is not really interesting for d > 3, since every 
graph can be embedded into the 1-skeleton of a 4-polytope. However, the explicit 
construction of this embedding may be useful. 


16.4.1. Transversality of 1-engaged representations. We need the con- 
dition of transversality for 1-engaged representations. Applying the general defini- 
tion, a l-engaged representation is transversal, if the hypersurfaces in R@ defined 
by the equations in (corresponding to the edges of G) intersect transversally 
at u. This leads to a very similar condition as for orthogonal representations: the 
representation u is transversal if and only if (G,u) carries no homogeneous stress, 
ie., there is no nonzero G-matrix X with zero diagonal such that >> j Xu; = 0 
for every node 7. (In the case of orthogonal representations, the equations in the 
definition correspond to the edges, so there we excluded homogeneous stress on the 
complement of G.) 

As before, it is useful to find some reformulations of transversality. Suppose 
that (G, u) carries a homogeneous stress X. Multiplying the equation 5> j Aiguy =0 


by u;, we get 
> Xiygujui = S- Xij = 0, 
j J 


and this in turn gives that >), X;j(uj—u;) = 0. So a homogeneous stress on a 
l-engaged representation is automatically a stress. 

The converse of this statement is almost, but not quite, true. If we have a 
stress X on (G,u), then multiplying the equation yy Xj;(u;—u;) = 0 by uj, we 


get 
Yo Xij(uy — us) "a = oS Xis) (u;|?—1) = 0. 


This means that >); Xij = 0 holds whenever |u;| # 1, and in this case it follows 
that )), Xijuj = 0 also holds. 

These considerations show that every strong 1-engaged representation of a twin- 
free graph in dimension 8 is transversal: Lemma|I6.30] implies that as a framework 
the graph is a subgraph of the skeleton of a 3-polytope, by Cauchy’s Theorem [14.5] 
this representation has no stress, and this implies that it has no homogeneous stress. 


16.4.2. The Colin de Verdiére number and l-engaged representa- 
tions. We denote by y(G) the smallest dimension in which G has a transversal 
l-engaged representation. This quantity is closely related to the Colin de Verdiére 
number. 


Theorem 16.32. For every graph G with at least 3 nodes, we have y(G) = n—1— 
u(G). 


While this theorem establishes a useful connection between a graph and its 
complement, it is not really deep (and not a real duality): the quantity ~(G) is 
defined as a maximum, 7(G) is defined as a minimum. So the statement of the 


lemma has the form minimum = minimum. 
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Proof. First, let k = u(G), and let M be a Colin de Verdiére matrix of G. 
Then M is a well-signed G-matrix with exactly one negative eigenvalue. Let Ay < 
O = rA2 = +++ = Angi < Anzo < ... be the eigenvalues of M. By scaling, we may 
assume that Ay = —1. Let v, be the eigenvector of unit length belonging to )j,. 
Since M is well-signed, we may assume that all entries of v; are positive. We form 
the diagonal matrix D = diag(v1). 

We start with getting rid of the negative eigenvalue, and consider the matrix 


M' = D"\(M+vivj{)D7' = D"'MD'+4. 
This matrix is symmetric and positive semidefinite, and so we can write it as a 


Gram matrix M’ = Gram(u), where u: V — R”~". From the fact that M is a 
well-signed G-matrix, it follows that 


=1 forijeE, 


(16.11) uju,; = M!, jee 
T)<1 forij cE. 


(The somewhat artificial scaling above has been introduced to guarantee this.) The 
diagonal entries of IM’ are positive, but they can be larger than, equal to, or smaller 
than 1. 

Thus we get a vector-labeling u in R"~*~! satisfying (16.11). We claim that 
(G, u) carries no homogeneous stress. Suppose that X is such a homogeneous stress. 
We have seen that this implies that )/, Xi; = 0. In other words, JX = 0. Let 
Y 20 XD-*, then 


MY = D(M'-J)DD"'XD"' = DM'XD™ =0. 


Since Y 4 0 and has the same 0 entries as X, this contradicts the assumption that 
M is transversal. 

The converse construction is straightforward. Let i++ u; € R@ be a 1-engaged 
representation that carries no homogeneous G-stress, let A = Gram(u), and let 
M = AA'-—J. It is clear that M is a well-signed G-matrix with at most one 
negative eigenvalue, and has corank at least n—d—1. 

The only nontrivial part of the proof is to show that M has the Strong Arnold 
Property. Suppose that MX = 0 for a nonzero G-matrix X with zero diagonal. 
Then AA'X = JX, and hence rk(A'X) = rk(X'AA'X) = rk(X'JX) < rk(J) = 
1. (We have used the fact that rk(B) = rk(B'B) for every real matrix B.) If 
rk(A'X) = 0, then A'X = 0, and so X is a homogeneous stress on (G,u), which 
contradicts the hypothesis. So we must have rk(A'X) = 1, and we can write 
A'X = uv', where u € R¢@ and v € RY are nonzero vectors. Hence Auv! = 
AA'X = JX =1(X1)"', which implies that (rescaling u and v if necessary) Au = 1 
and v = X1. Hence v = X Au = vu'u, and so |u| = 1. It follows that 


(A—1'u)(A—1'u)' = AA'- J =M, 


and so M is positive semidefinite. But then its smallest eigenvalue is 0, which has 
multiplicity at least n—d—1 > 1, contradicting the Perron—Frobenius Theorem. 


16.4.3. 1-engaged representations in low dimension. The following the- 
orem is a reformulation of Theorem [16.16 


Theorem 16.33. Let G be a simple twin-free graph. 
(a) If G is outerplanar, then y(G) < 3; if G is not outerplanar, then (G) > 3; 
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(b) If G is planar, then y(G) < 4; if G is not planar, then y(G) > 4. 


Proof. We describe the proof of part (b); the proof of part (a) uses similar, 
but much simpler arguments, and is not given here. Corollary [16.31] implies the 
second assertion, so what we need to prove is that G is planar, then 7(G) < 4. 

We may assume that G is 3-connected, since adding edges to G can only increase 
4(G) by minor-monotonicity. Let v : V — R® be the cage representation in 
Theorem [5.10] of a planar graph G. For 77 € E, the segment connecting v; and 
v,; touches the unit sphere at a point w. Then v;—w and v;—w are parallel 
vectors, orthogonal to w, and pointing in opposite directions. Their length is 
easily computed by Pythagoras’ Theorem: |v;—w| = /|v;|?—1 and |v;—w| = 


/|v;|? —1. This gives 
viv; = 1+(v;-—w)'(v;-—w) = 1-y |v;|? —14/ |v, |?—1. 


Introducing the vectors 
ae 
= : R4, 
‘ () |vi|? — ) 


we get that u/u; = 1. It follows similarly that if the segment connecting the two 
vectors intersects the sphere, then uju; <i. 

Thus we get a l-engaged representation in R+. This representation is transver- 
sal: any stress on (Gu) would be a stress on (G, v) as well, which is impossible by 
Cauchy’s Theorem [14.5] 


Remark 16.34. Perhaps the most interesting special case is that of a planar but 
not outerplanar graph G. Theorem implies that 7(G) = 3 or y(G) = 4. 
In it is proved that a triangulation of the sphere 
(maximal planar map) with at least 8 nodes has y(G) = 3 if and only if no 3- or 4- 
cycle separates it into two parts with at least two nodes. (The case of non-maximal 
planar graphs is open.) 


16.5. Related representations 


A related representation was studied in |Kang et al. 2011), where the following 


conditions were imposed: 


(16.12) ulu, i Ses 

<1 ifweH#. 
The results are similar to some of those described above: Every outerplanar graph 
has a representation in R® satisfying (16.12), and every planar graph has such a 
representation in R+. There are outerplanar graphs that do not have such a repre- 
sentation in R?, and there are planar graphs that do not have such a representation 


in R?. Estimates on the minimal dimension in terms of other graph parameters 
[Brightwell-Scheinerman 1903} 

We have repeatedly come across representations in which the subgraph induced 
by the nodes placed in certain halfspaces are connected. A halfspace in the following 
discussion is the (open) set {x € R?: a'x > b}, where a € R?@ is a nonzero vector. 
If b = 0, we call the halfspace linear. If we want to emphasize that this is not 
assumed, we call it affine. We say that a representation v: V —> R? of a graph 
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G has the connectivity property with respect to a family of subspaces, if for every 
halfspace in the family, the set v~'(H) induces a connected subgraph of G. We 
allow here that this subgraph is empty; if we insist that it is nonempty, we say that 
the representation has the nonempty connectivity property. 


Example 16.35. The skeleton of a convex polytope in R? has the connectivity 
property with respect to all halfspaces (cf. Appendix|C.1). 4 


Example 16.36. The rubber band representation of a 3-connected planar graph, 
with the nodes of a country nailed to the vertices of a convex polygon, has the 
connectivity property with respect to all halfplanes (Claim 1 in the proof of Theorem 


3.2). 4 


Example 16.37. The nullspace representation defined by a well-signed G-matrix 
M with exactly one negative eigenvalue, has the nonempty connectivity property 
with respect to all linear halfspaces whose boundary hyperplane is spanned by 
representing vectors (Theorem|14.10), and also with respect to all linear halfspaces 
whose boundary contains no representing vectors. 

If the graph is 3-connected planar, then the nullspace representation defined 
by a well-signed G-matrix has the nonempty connectivity property with respect to 


all linear halfspaces (Corollary |14.11). 4 


A general study of this property was initiated by Van der Holst, Laurent and 
Schrijver [van der Holst et al. 1995a). In one version, the authors consider full- 
dimensional representations in R? with the connectivity property with respect to all 
halfspaces. It is not hard to see that not having such a representation is a property 
closed under minors. They give the following (nice and easy) characterization: 


Theorem 16.38. A graph has a full-dimensional representation in R¢ with the 
connectivity property with respect to all halfspaces if and only if it is connected and 
has a Ka41 minor. 


We note that this result does not supply a “good characterization” for 
graphs not having a Kg.; minor (both equivalent properties put this class 
into co-NP). We know from the Robertson—Seymour theory of graph minors 
(see for a survey) that not having a Ka+1 
minor is a property in P, and explicit certificates (structure theorems) for this are 
known for d < 5. To describe an explicit certificate for this property for general d 
is an outstanding open problem. 

To sketch the other version, let us consider full-dimensional representations in 
R? with the nonempty connectivity property with respect to all linear halfspaces. 
No full characterization of graphs admitting such a representation is known, but 


the following is proved in [van der Holst et_al. 1995a): 


Theorem 16.39. A graph has a full-dimensional representation in R° with the 
nonempty connectivity property for all linear halfspaces if and only if it can be 
obtained from planar graphs by taking clique-sums and subgraphs. 


Exercise 16.1. Let f(G) denote the maximum multiplicity of the largest eigen- 
value of any matrix obtained from the adjacency matrix of a graph G, where 1’s 
are replaced by arbitrary positive numbers and the diagonal entries are changed 
arbitrarily. Prove that f(G) is the number of connected components of the graph 
G. 
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Exercise 16.2. The complete graph Ky, is the only graph on n > 3 nodes with 
w=n-l. 


Exercise 16.3. If G has at least one edge, then u(G) = maxy u(H), where H 
ranges over all connected components of G. 


Exercise 16.4. Let M be a Colin de Verdiére matrix of a path P,,, let 7 be an 
eigenvector of M belonging to the negative eigenvalue, and let w: V(Pn) > R be 
a nullspace representation of M. Then the mapping i +> w:/7; gives an embedding 


of G in the line |Lovasz—Schrijver 1999}. 


Exercise 16.5. Let M be a Colin de Verdiére matrix of a 2-connected outerplanar 
graph G, and let w : V — R? be a nullspace representation of M. Then the 
mapping i 4 v; = w?, together with the segments connecting v; and v; for 
ij € E, gives an embedding of G in the plane as a convex polygon with noncrossing 
diagonals. 


Exercise 16.6. Let 7): S? > S? be a local homeomorphism. (a) Prove that 
there is an integer k > 1 such that |y~1(x)| = k for each « € S?. (b) Let G be 
any 3-connected planar graph embedded in S”, then 7)~'(G) is a graph embedded 
in S?, with kn nodes, km edges, and k|F(G)| faces. (c) Use Euler’s Formula to 
prove that k = 1 and y is bijective. 


Exercise 16.7. Let ij be a cut-edge of the graph G. Let A be a G-matrix with 
Aj; # 0, and let u be the nullspace representation belonging to A. Then u; and 
u; are parallel. 


Exercise 16.8. Prove that if a graph is embedded in R* in such a way that every 
pair of disjoint cycles have an even linking number, and we perform a A—Y or 
Y —A transformation, then the new edges can be embedded so that the resulting 
embedded graph has the same property. 


Exercise 16.9. Prove that for every embedding of Kg in R®, the sum of linking 
numbers of disjoint pairs of triangles (10 pairs) is odd. 


Exercise 16.10. All triangles in a flatly embedded simple graph can be simulta- 
neously paneled. 


Exercise 16.11. Let P be a d-polytope with no parallel faces, and let F, and 
Fy be antipodal faces. Prove that there are two parallel supporting hyperplanes 
A, and H2 such that F; = Hin P. Show by an example that this does not hold 
if parallel faces are allowed. 


Exercise 16.12. Prove that if G is the disjoint union of graphs G; and G2, then 
V(G) < min{y(Gi)+|V(G2)|, y(G2) +|V (G1) }- 


Exercise 16.13. A graph is planar if and only if its 1-subdivision has a 1-engaged 
representation in R?. 


Exercise 16.14. Every graph has a subdivision with y < 4. 


Exercise 16.15. If G is a planar map of girth at least 5, then y(G) < 3. 


Exercise 16.16. Prove that a graph G has a full-dimensional representation in 
R¢ with the connectivity property with respect to all linear halfspaces, such that 
the origin is not contained in the convex hull of representing vectors, if and only 
if it has a Kg minor. 


Exercise 16.17. Prove that if a graph G has a full-dimensional representation 
in R¢ in general position with the connectivity property with respect to all linear 
halfspaces, then G is d-connected. 
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Exercise 16.18. Prove the following characterizations of graphs having full- 
dimensional representations in R¢ with the nonempty connectivity property with 
respect to all linear halfspaces, for small values of d: (a) for d = 1: forests; (b) for 
d = 2: series-parallel graphs (graphs with no K4 minors). 


CHAPTER 17 


Metric Representations 


Given a graph, we would like to embed it in a Euclidean space so that the 
length of shortest paths between nodes in the graph (the graph metric) should be 
the same as, or at least close to, the geometric distance of the representing vectors. 
Our goal is to illustrate some of the embedding techniques, and, mainly, the use of 
such embeddings to prove graph-theoretic theorems and to design algorithms. 

We start with discussing a few embedding techniques that preserve distances. 
It is not hard to see, however, that in most nontrivial cases every embedding will 
necessarily have some distortion. For example, the graph metric on the “claw” Ky 3 
cannot be embedded isometrically in a Euclidean space of any dimension. We get 
more general and useful results if we study embeddings where the distances may 
change, but in controlled manner. To emphasize the difference, we will distinguish 
distance preserving (isometric) and distance respecting representations. 

The complete k-graph can be embedded isometrically in a Euclidean space with 
dimension k—1, but not in lower dimensions. There is often a trade-off between 
dimension and distortion. This motivates our concern with the dimension of the 
space in which we represent our graph. 

Most of the methods that have been developed for such embeddings extend to 
more general (often all) finite metric spaces. Recall that a metric space is a set 
V endowed with a distance function dg: VxV — R, such that do(u,v) = 0 
if and only if wu = v, do(v,u) = de(u,v), and do(u,w) < de(u,v)+de(v,w) for all 
u,v, w € V. Sometimes it will be convenient to extend the representability problem 
to semimetric spaces (to spaces (V,d2) where dz: V x V is nonnegative, symmetric 
and satisfies the triangle inequality, but do(u,v) = 0 is allowed for different points 
u,vEeV). 

There is a large literature of embeddings of one metric space into another so 
that distances are preserved (isometric embeddings) or at least not distorted too 
much. These results are often very combinatorial, and have important applica- 
tions to graph theory and combinatorial algorithms; see [Indyk—Matousek 2004], 
[Matousek 2002], for more. 

Here we restrict our interest to embeddings of (finite) metric spaces into basic 
normed spaces. It will be often convenient to use the scaled version of the standard 
ép-norm, namely 


d 
1 1 1/p 
_ =n = |p 
lel» = glen = (5) le ) 
i= 
for « € R@ and p > 1. In the case of p = oo, we have |z|o. = ||2||.. = max; |x;|. 


This version of the norms can be considered as function norms over the (finite) 
probability space {1,...,d} endowed with the uniform measure. With this scaling, 
we have ||x||, < ||x||_ for 1<p<q<oo. 
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An £,-representation of a finite metric space (V,d2) is a mapping u: V > R™ 
such that do(i, 7) = ||u;—u,]||,. We say that (V,d2) is 2,-representable, or shortly 
an £,-metric. Note that in the definition of this representability we could use the 
|.|p>-norm instead of the ||.||)-norm, by scaling the representing vectors by a factor 
of d!/?, 

In our limited treatment of metric embeddings, our main interest will lie in 
one embedding technique, the Bourgain random subset representation, which has 
a variety of graph theoretic applications. It is perhaps natural that we need to do 
some preliminary work discussing some examples of isometric embeddings which 
will be needed later. We also need a fundamental tool, the Johnson—Lindenstrauss 
Lemma, which helps to control the dimension of the embeddings. 

After treating the random subset representation and its applications, we intro- 
duce the fascinating notion of volume-respecting embedding, which can be consid- 
ered as a quantitative version of “general position”. The random subset represen- 
tation method can be extended to this case, but this will be somewhat hard going, 
requiring several facts from high dimensional geometry, which are collected in the 
Appendix. 


17.1. Preserving distances 


17.1.1. Supremum norm. The embedding problem is easy if the supremum 
norm is used in the target space. 


Theorem 17.1 (Fréchet). Any finite metric space on n points can be embedded 
isometrically into (R”, ||-||o0)- 


The theorem is valid for infinite metric spaces as well (when infinite dimensional 
Lo spaces must be allowed for the target space), with a slight modification of the 
proof. We describe the simple proof for the finite case, since its idea will be used 
later on. 


Proof. Let (V,d2) be a finite metric space, and consider the vector labeling 
UH Uy = (do(v,7): 1EV) (vEV). 
By the triangle inequality, 
|| — Uy |loo = max |do(u, 7) —do(v, z)| < do(u, v). 


On the other hand, the coordinate corresponding to 2 = v gives 


|[Uu— Unlloo 2 |de(u, v) —da(v, v)| = de(u, v). 


17.1.2. Manhattan distance. Another representation with special combi- 
natorial significance is a representation in R” with the ¢,-distance (often called 
Manhattan distance). For a fixed underlying set V, all 2;-semimetrics form a closed 
convex cone: if i++ u; € R™ is an ¢-representation of (V,d2) and its ul € R™ is 
an ¢,-representation of (V,d4), then 


is an £)-representation of adz+a’d) for a,a’ > 0. 


17.1. PRESERVING DISTANCES 325 


A very useful way of representing @;-semimetrics is by Hamming metrics. We 
have a measurable space (X, 4), and a family of measurable subsets of finite measure 
X; C X (i € V+). Define 

do(i, j) = w(X,AX,). 
It is trivial to check that this defines a semimetric. Since we are interested in finite 
metric spaces, we can always assume that the space X is finite: the sets X; partition 
X into a finite number of parts, and we can aggregate each part to a single atom 
(with the same weight). 

Hamming metrics on the same set V form a convex cone: clearly they can 
be scaled by any positive number by scaling the measure jz, and the sum of two 
Hamming metrics dy and dj is again a Hamming metric. To see this, let (V,d2) be 
represented by subsets X; in a measurable space (X, 4), and (V,d5) be represented 
by subsets X/ in a measurable space (X’, ju’), where we may assume that X and X’ 
are disjoint. Then (V,d2 +d‘) is represented by the sets X;UX/ in the measurable 
space (X UX’, wU yp’) (the measure sU ps’ being defined by (wUp’)(S'}) = w(SOX)+ 
uw (SOX")). 

This last remark is superfluous in the light of the following fact (but it is used 
in its proof): 


Proposition 17.2. Hamming semimetrics on a finite set coincide with ¢,- 
semimetrics. 


Proof. Let (V,d2) be represented by subsets X; in a finite measurable space 
(X, 2). Define a vector u; € R* by 
BD) a3 
ee if JE Xj; 
(ui)i = 4 [|X| 
0, otherwise. 


It is straightforward to check that this defines an ¢,-representation of (V,d2). 

Conversely, let u: V — R¢@ be an ¢,-representation of (V,d2). If k = 1, then 
++ [0, u;] is a Hamming representation of (V,d2) in the measurable space (R, A). If 
k > 1, we construct a Hamming representation for each coordinate, and add them 
up as described above. 


Of special interest are the @;-semimetrics (V,dg) defined by a subset S CV, 


- 1, if |{i,j} 9S] =1, 
esta) ={ Li, JH NS| 


0, otherwise, 


which we call a 2-partition distance. (These are often called cut-semimetrics, but I 
do not want to use this term since “cut norm” and “cut distance” are used in this 
book in a different sense.) 2-partition distances and their connection to Manhattan 
metrics are discussed in detail, with many applications from number theory to 


optimization, in the book |Deza—Laurent 1997). 


Proposition 17.3. A finite semimetric space (V,d) is an ¢,-semimetric if and only 
if it can be written as a nonnegative linear combination of 2-partition distances. 


Proof. Every 2-partition distance dg can be represented in the 1-dimensional 
space by the map lg. It follows that every nonnegative linear combination of 2- 
partition distances on the same underlying set is an £;-semimetric. Conversely, 
every ¢;-semimetric is a sum of ¢,-semimetrics representable in R! (just consider 
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the coordinates of any ¢1-representation). So it suffices to consider ¢,-semimetrics 
(V,d) for which there is a representation i++ u; € R such that d(i,7) = |uj—u,|. 
We may assume that V = [n] and uy <--- < Un, then for i < J, 

n—-1 


d(i,j) = So (uesa —Ur)d64,...,k} 


k=1 


expresses d as a nonnegative linear combination of 2-partition distances. 


There are a lot of results relating the ¢;-metric to other classical metrics on 
linear spaces, most notably to £,-metrics. We only need the following fact. 


Proposition 17.4. If a finite metric space (V,d) is €g-representable, then it is 
£,-representable. 


Proof. Let (V,d) be a finite metric space and let u: V — R®@ be an 2- 
representation. We may assume (by trivial scaling) that u(V) C BY. 

By Proposition [17.2] it suffices to represent (V,d) as a Hamming distance. 
Consider the measurable space (B*, 11), where the measure is defined by 


y= f |x|1— iglica & 


(the constant C' will be determined later). It is easy to see that 
Wx | \([-x, x] NY) do(x), 
Sd-1 


where A is the 1-dimensional Lebesgue measure and a is the (d—1)-dimensional 
Lebesgue measure on S@~!. 


Uy 


FIGURE 17.1. Representing an ¢2 metric in @}. 


The sets X; representing the points in V are easy to define: X; is the “Thales 
ball” over the segment [0, u;]. In formula, this means that 


X; = {xe R*: x'(u;—x) > 0}. 


To calculate the measure of X;AXj;, note that for any x € S*~1, the intersection 
[—x, x] (X;A.X;,) is just the segment between the orthogonal projections of u; and 
u, onto the diameter [—x, x] (Figure|17.1). Hence 


A([-x, x] (X;AX;)) — |x"u; —x"u,| — |x"(u;—u,)| = |u;—u,| |x"(uj—u,)°|. 
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Integrating over x € S*~!, the integral of |x'(u;—u,)°| is independent of (u;—u;)° 
& g g j j 


(which is just a unit vector). Hence 


i= A([-x, x] (X;AX;)) do(x) ~ |u;—uy|, 


and the scaling factor C above has been chosen so that w(X;AX,) = |u;— uy]. 


The converse of the last proposition does not hold: there are ¢;-representable 
finite metric spaces that are not ¢)-representable, as shown e.g. by the graph 
distance in the 4-cycle. 


17.2. Respecting distances 


As we have seen, the possibility of a geometric representation that exactly 
reflects the graph distance is limited, and therefore we allow distortion. Let 
F: V, — V2 be a mapping of the metric space (V,,d,) into the metric space 
(V2,d2). We define the distortion of F' as 


do(FWFW) 7 yg SCF FO), 


wv V4 di(u, v) di(u, v) 


Note that to have finite distortion, the map F' must be injective. The distortion 
does not change if all distances in one of the metric spaces are scaled by the same 
factor. So if we are looking for embeddings in Euclidean spaces (more generally, in 
normed linear spaces), then we may scale the vectors, and restrict our attention to 
embeddings that are contractive, that is, satisfying do(F(u), F(v)) < di(u,v) for 
all u,v € Vi. 

We present some basic theorems about embeddings, and give an important 
application in flow theory. 


17.2.1. Dimension reduction. In many cases, we can control the dimension 
of the ambient space based on general principles, and we start with a discussion of 
these. The dimension problem is often easy to handle, due to a fundamental lemma 


Johnson—Lindenstrauss 1984}. 


Lemma 17.5 (Johnson—Lindenstrauss). For every 0 < «<1, every n-point set 
S CR” can be mapped into R@ with d < (80Inn)/e? with distortion at most € in 
the Euclidean distance. 


Proof. Orthogonal projection onto a random d-dimensional subspace L does 
the job. First, let us see what happens to the distance of a fixed pair of points 
x,y € S. Instead of projecting the fixed segment of length |x—y| on a random 
subspace, we can project a random vector of the same length on a fixed subspace. 
Then (C.17) in the Appendix applies, and tells us that with probability at least 
1—4e-* 4/4 the projections x’ and y’ of every fixed pair of points x and x’ satisfy 


1 d '_y’! d 
fe ee S 
l+eVn |Ix—y| n 


It follows that with probability at least 1— (Rdg 4 this inequality holds si- 
multaneously for all pairs x,y € S, and then the distortion of the projection is at 
most (1+¢)? < 1+3¢. Replacing ¢ by ¢/3 and choosing d as in the Theorem, this 
probability is positive, and the lemma follows. 
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Remark 17.6. More careful computation gives a constant of 8 instead of 80. It 
will also be important later that the mapping constructed in the proof is linear, 
and it can be assumed that it is nonsingular when restricted to any d-dimensional 
affine subspace determined by S. 

The Johnson—Lindenstrauss Lemma does not extend to all values of p. To 
be more precise, there are examples of n-point subsets S of R” for which every 
mapping ¢: S > R¢ (d < n°) has a distortion of at least (c?), if the ¢,-distance 
is used in both $ and 4(S) [Brnlanan—Charikar 2005). See for a 


survey of the many extensions and variants of the Johnson—Lindenstrauss Lemma. 


17.2.2. Embedding with small distortion. We describe an important con- 
struction for the embedding of a general metric space into a Euclidean (2) space 


Bourgain 1985], which has low distortion (and other interesting and useful prop- 


erties, as we will see later). 


Theorem 17.7. Every metric space with n points can be embedded in the Euclidean 
space R@ with distortion O(logn) and dimension d = O(log n). 


To motivate the construction, let us recall Fréchet’s Theorem we embed a 
general metric space (V,d) isometrically into 0, by assigning a coordinate to each 
point w € V, and considering the representation 


x: ib (d(w,t): weEV). 


If we want to represent the metric space by Euclidean metric with a reasonably 
small distortion, this construction will not work, since it may happen that all points 
except u and v are at the same distance from u and v, and once we take a sum of 
coordinates instead of the maximum to compute the norm, the contribution from 
u and v will become negligible. The remedy will be to take distances from sets 
rather than from points; it turns out that we need sets with sizes of all orders of 
magnitude, and this is where the logarithmic factor is lost. 

We start with describing a simpler version, which is only good “on the average”. 
For a while, it will be more convenient to work with @; distances instead of Euclidean 
distances. Both of these deviations from our goal will be easy to fix. 

Let (V,d) be a metric space on n elements. We may assume that the diameter 
of the space is at most 1. Let m = [logn], and for every 1 < r < m, choose a 
random subset A, C V, putting every element v € V into A, with probability 
2-". Let d(v, A;) denote the distance of node v from the set A; (if A; =, we set 
d(v,A;) = 1). Consider the vector labeling x: V — R™ defined by 


(17.1) X, = (d(u, Ai),.--,d(u, Am)) 


We call this the random subset representation of (V,d) in &p. 


Lemma 17.8. The random subset representation is contractive with respect to any 
lp-norm. 


Proof. For u,v € V, by the triangle inequality, 
ld(u, Ai) = d(v, Aj)| < d(u, v), 


and hence 


Xu —Xv||p < ||[Xu—Xvlloo < d(u, v). 
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To prove deeper properties of this representation, we have to use that it depends 
on the random choice of the sets A;, and so we have to make probabilistic statements 
about it. It will be convenient to introduce, for any real-valued random variable X, 
the median median(X), defined as a real number such that P(X < median(X)) < 
1/2 and P(X > median(X)) < 1/2. There may be a whole interval of such values, 
in which case we denote by median(X) the midpoint of this interval (just to make 
it well defined). 

It is easy to see that every median of X minimizes E(|X—y|) over y € R 
(assuming the expectation is finite). In other words, E(|X —median(X)|) < E(|X — 
y|) for every y € R. For arandom vector X = (X1,..., Xm), we define median(X) = 
(median(X1),...,median(X;)). 

Our next lemma says that for a fixed node u, the vector x, cannot be too 
concentrated. 


Lemma 17.9. Let u€ V and 0 < 6 < diam(V,d). Define X,. = min(d(u, A,), 6) 
and X = (X,...,Xm). Then 


AIL 


E(|X—median(X)|1) > 


Proof. For any integer r € [m] and real number t € (0,6), the inequality 
X, > t is equivalent to d(u,A,;) > ¢, which in turn means that A,N B(u,t) = 0. 
Hence 
(17.2) P(d(k, A,) > x) = (1—p,)!®@*)! = (2k)-9 1B, 


Define r(x) = |alog|B(k,x)||, then clearly 1 < r(x) < m. Formula (17.2) implies 
that 


1 1 
(17.3) 3k < P(d(k, A,jz)) > 2) < OK 
Hence 
(17.4) Bee aren ee 
4 — — 4 
for every t. 


Whatever the value of median(X,,¢)) is, X,(¢) is on the other side of t with 
probability at least 1/4 by (17.4). Hence P(t € [X,),median(X,))]) > 1/4. The 
coordinate r(t) depends on t, but we get rid of this dependence if we sum over r: 


3 P(t € [X;,, median(X,)]) > 


r=1 


Ble 


Now let T € [0,6] be a uniform random number, then for every choice of x and y, 
P(T € [x,y]) < |z—y|/d, and so 


E(|X —median(X)|1) = S- E(|.X;—median(X,.)|) 


ra]. 


> o 5P(T € [X;, median(X,.)]) > 


r=1 


el oa 


As an easy application of this lmma, we bound the distance of two represen- 
tative points, at least in expectation. 
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Lemma 17.10. The random subspace representation (17.1) satisfies 
d(u, v) 
E(||Xu—Xollp) 2 Sen. 


for every patr of points u,v EV. 


Proof. By the inequality relating different @,-norms, it suffices to prove this 
for p = 1. Define 6 = d(u,v)/2, X, = min(d(u, A,),6), Y; = min(d(v, A,), 6), 
X = (Xj,...,Xm)' and Y = (Yi,...,Ym)'. Since X and Y depend only on how 
A,. behaves in the disjoint open neighborhoods of radius 6 of u and v, respectively, 
these quantities are independent random variables. Fix Y and invoke Lemma|[17.9} 


6 
E(IK—¥lh) > =. 


It is easy to check that 
|d(u, Ai) —d(v, Aa)| 2 |Xi-Yil, 


and so 


E(xu—xvlh) > E(IX—¥]) > 2 = A"). 


4m 8m 


Now we are ready to prove the theorem. 


Proof of Theorem[17.7| Lemma{I7.8|says that the random subset embedding 
is contractive, so it suffices to prove that it does not contract any distance by more 
than a factor of m = O(logn), with large probability. The previous lemma only 
bounds the expectation of these distances. The Law of Large Numbers suggests the 
remedy: the distance ||x,,—x,||1 is the average of m independent bounded random 
variables, and hence it will be close to its expectation with high probability, if m 
is large enough. We can multiply m basically for free. To be more precise, let us 
generate N independent copies x!,...,x of the representation (17.1), and consider 
the mapping y: V > R‘™, defined by 


(17.5) yu= 


Then 


1 
lyu—Yolla < d(u, v) and E(\l¥u —Yoll1) — E(||Xu —X»||1) a Bm eh): 


For every u and v, |[yu—Yol||1 3 E(||Ku—Xoll1) > d(u, v)/(8m) almost surely by the 

Law of Large Numbers, and hence if N is large enough, then with high probability 

d(u, v) 
9m 


(17.6) llyu—Yulla = for all u,v € V. 


So y is a representation in @,; with distortion at most 9m. 

The assertions above extend to every ¢,-distance instead of the ¢;-distance (in 
particular, to the Euclidean distance): first, ||yu—Yyollp < |[vYu-Yolloo < d(u,v) 
just like above, and second, ||yu—Yollp > |[Yu-—Yoll1 > d(u,v)/(9m) with high 
probability for all u,v EV. 


17.2. RESPECTING DISTANCES 331 


We are not done yet, since choosing a large N will result in a representation in a 
very high dimension. For the Euclidean distance (p = 2), we can apply the Johnson— 
Lindenstrauss Lemma [17.5] (in other words, we apply a random projection), to 
reduce the dimension as stated in the Theorem. 


Remark 17.11. The theorem remains valid for all p > 1, but the proof needs 
additional work, since the Johnson—Lindenstrauss Lemma does not hold for all p. 
One can estimate the concentration of ||x,—x,||; using the Chernoff-Hoeffding 
Inequality; the details of this computation are omitted. 


Deep connections of the random subset representation with graph theory were 
described by |Linial-London-Rabinovich 1995). They showed how to construct an 
embedding satisfying the conditions in Theorem [17.7] by a deterministic algorithm, 
and gave the important application to flow theory to be described in Section 
They also showed that Bourgain’s embedding is essentially optimal, in the sense 


that the O(log) factor cannot be improved (see also [Matousek 1997] ): 


Proposition 17.12. Let G be an a-regular graph (a > 3), which is a b-edge- 
expander (b > 0), and let (V,d) be the graph metric of G, then every embedding of 
(V,d2) in a Euclidean space has distortion at least b log(n/2)/loga. 


Proof. Since the graph is a-regular, the number of nodes accessible from a 
given node in at most t steps (t > 1) is bounded by 1+a+a(a—1)+---+a(a— 
1)*-! < a’. So in fewer than log(n/2)/loga steps fewer than half of the nodes can 
be reached, which implies that 


S- d(u,v) > loeiai2) = 


Rees ~ loga 2 


Since the number of edges is an/2, it follows that the graph distance satisfies the 
inequality 


(17.7) a d(u, v) > ie logtt/2) So d d(u, v) 


a loga 
UjvEV 8 uveE 


On the other hand, for the 2-partition distance dg determined by a set S C V, we 
have 


» dg(u,v) = [S| IV\ SI, 


U,vEV 
and by the hypothesis about the expansion of the i 


S© ds(u,v) = ec($,V\$) Se S| IV\ SI, 
uver 


whence 
nm 
S- dg(u,v) < a S- dg(u,v). 
u,vEeV UveE 


Since every ¢-semimetric is a nonnegative linear combination of 2-partition dis- 
tances by Proposition |17.3} it follows that every ¢;-semimetric dj satisfies 


(17.8) S> ay(u,v) < = S> dy(u,v). 


Uu,vEV uvee 
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Comparing (17.7) and (17.8), we see that every /;-metric approximating the graph 
distance must have distortion at least (log(n/2))/loga. By Proposition |17.4} the 
same holds for every €2-metric. 


17.2.3. Multicommodity flows and approximate Max-Flow-Min-Cut. 
The following fundamental result in the theory of multicommodity flows was 
proved in |Leighton—Rao 1998}. Stated in a larger generality, as proved in 
Po. it says the following. Suppose that we have a 
multicommodity flow problem on a graph G. This means that we are given k pairs 
of nodes (s;, ¢;) (¢ = 1,...,), and for each such pair, we are given a demand d; > 0. 
Every edge e of the graph has a capacity ce > 0. We would like to design a flow 
f* from s; to t; of value d; for every 1 <i <k, so that for every edge the capacity 
constraint is satisfied. 

We state the problem more precisely. We need a reference orientation of G; 
let E denote the set of these oriented edges. We want an (s;,t;)-flow f* for every 
1<i<k, with prescribed flow values d;, so that the total flow through an edge 
does not exceed the capacity of the edge: 


k 
(17.9) SIF <ce for alle € E(G) 


i=l 


(this clearly does not depend on which orientation of the edge we consider). 

Let us hasten to point out that the solvability of a multicommodity flow problem 
is just the feasibility of a linear program (we treat the values f(e) as variables). So 
the multicommodity flow problem is polynomial time solvable. We can also apply 
the Farkas Lemma, and derive a necessary and sufficient condition for solvability. If 
you work out the dual, quite likely you will get a condition that is not transparent 
at all; however, there is a very nice form [Shahroki-Matula 1990}, which 
fits particularly well into the topic of this book. 

Consider a semimetric D on V. Let us describe an informal (physical) deriva- 
tion of the conditions. Think of an edge e = uv as a pipe with cross section C¢ 
and length D(e) = D(u,v). Then the total volume of the system is }>, c-D(e). If 
the multicommodity problem is feasible, then every flow f* occupies a volume of at 
least d;D(s;,t;), and hence 


(17.10) Sd: D(si, ti) < > ceD(e). 


We call this inequality the volume condition. This condition, when required for 
every semimetric, is also sufficient: 


Theorem 17.13. Let G be a graph, let s;,t; € V, dj € Ry (@ = 1,...,k), and 
Ce € Ry (e € E). Then there exist (s;,t;)-flows (f’: i € [k]) satisfying the demand 
conditions val(f’) = d; and the capacity constraints if and only if the volume 
condition holds for every semimetric D in V. 


We leave the exact derivation of the necessity of the condition, as well as the 
proof of the converse based on linear programming duality, to the reader. 

The obvious cut conditions obtained by specializing for 2-partition met- 
rics, provide a system of necessary conditions for the problem to be feasible: If the 
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multicommodity flows exist, then for every S C V, we must have 


(17.11) ys eS SS & 


tl(e)ES,hd(e)¢S i: SN{s;,t:}=1 


In the case of one or two commodities (kK < 2), these conditions are necessary 
and sufficient; this is the content of the Max-Flow-Min-Cut Theorem of Ford and 
Fulkerson (kK = 1) and the Gomory—Hu Theorem (k = 2). However, for k > 3, the 
cut conditions are not sufficient any more for the existence of multicommodity flows. 
The theorem of Leighton and Rao asserts that if the cut-conditions are satisfied, 
then relaxing the capacities by a factor of O(logn), the problem becomes feasible. 
The relaxation factor was improved by Linial, London and Rabinovich to O(log k); 
we state the result in this tighter form, but for simplicity of presentation prove the 
original (weaker) form. 


Theorem 17.14. Suppose that for a multicommodity flow problem, the cut condi- 
tions (17.11) are satisfied. Then replacing every edge capacity ce by 20ce logk, the 


problem becomes feasible. 


Proof. Using Theorem |17.13} it suffices to prove that for every semimetric D 
on V, 


(17.12) S 7 dk, D(si, ti) < 20(logn) S~ caD(E). 
a ecH 
The cut conditions are equivalent to the volume conditions (17.10) for 2- 
partition distances, which in turn implies their validity for semimetrics that are 
nonnegative linear combinations of 2-partition distances, which, as we have seen, 
are exactly the ¢;-semimetrics. By Bourgain’s Theorem[17.7] there is an ¢-metric 
D’ such that 
D'(u,v) < D(u,v) < 20(log n)D’(u, v). 
We know that D’ satisfies the semimetric condition (17.10), and hence D satisfies 
the relaxed semimetric conditions (17.12). 


17.3. Respecting the volume 


A very interesting extension of the notion of small distortion was formulated 
in [Feige 1998]. We want an embedding of a metric space in a Euclidean space 
such that is contractive, and at the same time volume respecting up to a given 
size s, which means that every set of at most s nodes spans a simplex whose 
volume is almost as large as possible. Obviously, the last condition above needs an 
explanation, and for this, we will have to define a certain “volume” of an arbitrary 
metric space (a fascinating notion in itself). Respecting the volume can also be 
viewed as a quantitative version of “general position”. We describe an algorithmic 
application of these results to the approximate computation of the bandwidth of a 
graph. 


17.3.1. Dimension reduction for volume. Before defining the volume in 
general metric spaces, we stay for a while with the traditional notion of volume, and 
prove a generalization of the Johnson—Lindenstrauss Lemma{17.5} [Magen 2007]. To 
simplify notation, for a finite set S C R¢ with k elements, we define vol(S) as the 
(k—1)-dimensional volume of conv($). For a vector labeling x: S — R¢ of a 
finite set S with k elements, we denote by vol,(S') the (& —1)-dimensional volume 
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of conv(x(S)). If x is injective, then this is the same as vol(x(S)). Note that vol(S) 
can be zero if S is not affine independent, and similar warning holds for vol,(S). 


Lemma 17.15. For every real number « > 0 and integer k > 2, and for every 
n-point set V C R”, there is a linear map ¢: R” — R¢ with d = O(e-klogn) 
such that for every SCV with |S|=r<k, we have the bounds 


(1—e)"~*vol(S) < vol(¢($)) < vol(S). 


Note that such a mapping is necessarily contractive with respect to the Euclidean 
distance and bijective on V. 


Proof. This lemma can be proved extending the proof of Lemma 17.5} but a 
more elementary proof can be obtained by reducing it to the Johnson—Lindenstrauss 
Lemma. Recall that this lemma provides a linear map ¢ : R” —> R®@ that is 
contractive and almost isometric on V. 

The property that the map is contractive does not imply that (say) areas of 
triangles are not increased, and the property that distances are not decreased by 
much does not imply that areas of triangles are not decreased by much. The 
idea is to embed V into a larger finite set W, so that approximately preserving 
distances between points in W will imply approximate preservation of volumes of 
r-dimensional simplices with vertices in V for all r < k. 

To show how this idea works, let us start with proving the upper bound. For 
every point v € S and every affine space A C R”, let v4 denote the orthogonal 
projection of v onto A. Let U be the set of all points vag(x), where X C V, |X| < k. 
(Note that V C U.) Let 6: R" > R® be a linear map into some dimension d, 
that is contractive on U. 

Consider a particular r-dimensional simplex conv(Y), where r < k and Y € 
Gat Choose any v € Y, and let X = Y \v and A = aff(X). We may assume (for 
notational convenience) that v4 = 0, then (v4) = 0, Aisa ert subspace, and 
v 1 A. Let y denote the orthogonal projection of é(v) onto ¢ A) ewe lt) 
and let h = |v| and h’ = |¢(v)—y| denote the distance of v Ls A and of ¢(v) 
from (A), respectively. 


A (V4)=0° 


FiGurRE 17.2. Adding projections to V, and the image of a simplex 
under the Johnson—Lindenstrauss mapping. 


By induction on r, we may assume that 


(17.13) vol(¢(X)) < vol(X). 
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Linearity of ¢ implies that lin(d(X )) = @(A), and since ¢ is contractive, h’ = 
|d(v) —y| < |d(v)| < |v] =A. Using (17.13), we get 


vol (¢(Y)) = M yol(3(X)) < *ol(X) = vol(Y). 


This proves the upper bound in the lemma. 

Unfortunately, this simple way of extending V is not sufficient to guarantee 
that volumes do not shrink much. We extend U further by considering, for every 
u € V and every affine subspace A spanned by points in V with dim(A) =r <k 
and u ¢ A, a cubic lattice Ly,4 in A of edge-length |Ju—uy|/./r, containing ua, 
and its finite subset Wu.4 = {Zz € Lua: |z| < 3Ju—uy|}. We add to V all of the 
sets Wy, to get a set W. 

We have to construct the contractive map more carefully as above: We apply 
the Johnson—Lindenstrauss Lemma|17.5]to the set W with error bound ¢; = ¢/22 
to get a linear map ¢: R” > R® for the appropriate dimension dp. 

To show that this mapping does not shrink the volumes of simplices of dimen- 
sion at most & too much, consider Y, X, A, v and y as above. The argument for the 
upper bound in the lemma remains valid, but now we can do a similar computation 
for the lower bound as well. Induction on r gives 


(17.14) vol(¢(X)) > (1—e)"!vol(X). 


Linearity of ¢ implies that ¢(Ly,4) is a lattice in (A); not cubic, but the con- 
tractive property of ¢ implies that the images of lattice cubes, which are the basic 
parallelopipeds of @(Ly,4), have diameter at most h. Let C be the lattice cube in 


Ly,a such that y € ¢(C) (Figure|17.3). 


FIGURE 17.3. The lattice added to V, and its image under the 
Johnson—Lindenstrauss mapping. 


Since ly| < |¢(v)| < |v], it follows that for every vertex z of C, we have 
|d(z)| < ly|+|o(z)—y| < 2h. Hence |z| < 3h, and so z € Wy4. By the distance- 
respecting properties of ¢, 


|o(v) — o(2)|? = (1-e1)?|z—v)? = (1-e1)*(la|? +h”), 
and (considering the point —z) 
|o(v) + o(2)|? < |v +a)? = |2)? +h?. 
Expanding and subtracting, we get 


(17.15) 4d(v)'o(z) < (26, —€?)(|z|? +h?) < 2€3(|z|? +h?) < 20€,h?. 
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This linear inequality holds for every vertex ¢(z) of the parallelotope 4(C), so it 
must hold for y € ¢(C) as well. Hence 


ly|? = o(v)'y < 20e,h?. 
Thus 
|O(v) yl? = |9(v)I? - ly? = (1—1)?h? — 20eyh? > (1—22e1)?h? = (1—€)?h?, 


and 


/ 


vol (¢(Y)) = vol(¢(X)) > (1-e)- (1—e)"—vol(X) = (1—«)"vol(Y). 


a 
To bound the dimension in the application of Lemma we have to estimate 
the size of W. Quite elementary computations yield that |W| < (20n)*. Thus 


1 k 
dz < 80 In|W| < 10°— Inn. 
Ey € 


17.3.2. Volume in metric spaces. Let (V,d) be a finite metric space with 
|V| =n. For a tree T on V(T’) = V, we define Ip = [J,,,cxcr) d(u, v), and we 
define the tree-volume of (V,d) by 


voli: (V, d) = min IIp. 


1 
(n—1)! 7 
Note that the tree T that gives here the minimum is also minimizing the total length 
of edges (this follows from the fact that it can be found by the Greedy Algorithm). 

Since we only consider one metric, denoted by d, in this section, we will not 
indicate it in voli;(V, d) and similar quantities to be defined below. We will, however, 
change the underlying set, so we keep V in the notation. The following lemma 
relates the volume of a simplex in an embedding with the tree-volume. 


Lemma 17.16. For every contractive map x: V —> RX of a metric space (V,d), 
volx(V) < volir(V). 


Proof. We prove by induction on n that for any tree T on V, 


1 
volx(V) < 


= a 


For n = 2 this is obvious. Let n > 2, and let 2 € V be a node of degree 1 in T, let 
j be the neighbor of 7 in 7, and let h be the distance of i from the affine hull of 
x(V \i). Then by induction, 


vol,(V) = —“—volx(V \i) < MD vo V\i) 
d(i,j) 1 1 
po” a 


The main result about tree-volume is that the upper bound given in Lemma 
17.16]can be attained, up to a polylogarithmic factor, for all small subsets simul- 
taneously. 
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Theorem 17.17. Every finite metric space (V,d) with n points has a con- 
tractive map x : V — R® (with respect to the Euclidean metric) with d = 
O((log n)* log log n), such that for every 2<k < logn and every S € eas 
I, (S) a > : lee (S) FE 
VOlx 2 OOOO VOltr are 
O(log nv/log fk) ‘ 

Proof. The proof of this theorem is built on the proof of Bourgain’s Theo- 
rem |17.7| but it is of course more involved. It will be convenient to assume that 
all distances in (V,d) are distinct; this can be achieved by an arbitrarily small 
perturbation. 

Instead of powers of 1/2, we define a finer sequence of probabilities for the 
densities of the random sets. We use the following parameters: 


1 1 
m = [(loglogn)logn], p, = 1—-—... 
(2m)é 


It is clear that 0 < p, < 1, and so we can construct a random subset A, C V, 
where every element v € V is put into A, with probability p,, independently of 
each other (1 <r < m). For notational convenience, we assume that V = [n]. For 
a while, we focus on a single set S', say S' = [k]. 

Consider the balls B; = B(j, $d(j,S\j)) for j € S. It is easy to see that these 
sets are disjoint. We may assume that |B,| = minjeg|B,|. 

Just as in the proof of Theorem|[I7.7] we need to show that for an appropriate 
r, the distance d(k, A,) is not too concentrated as a random variable; we need, 
however, a more complicated formulation of this fact than before. We let 6 = 
d(k, S\k)/2. 


Claim 1. There are real numbers x1, x2 € [0,6] with x2 > 214+ 
r €[m] such that the following inequalities hold: 


17.16 = 14+ ——_ 
( ) ° “Toplaan 


= 


am» and an integer 


P(d(k, Ar) < 21) > > P(d(k, Ar) > x2) > = P(d(k, A-) > 6) < =. 


For any integer r € [m] and real number 0 < a < 0, the inequality d(k, A,) > x 
is equivalent to A, B(k,x) = 0. Hence 
1 
(Qm) Beas 


Define r(x) = | (log log n) log |B(k, x)||, then clearly 1 < r(x) < m. Formula (17.17) 
implies that 


(17.17) P(d(k, A,) > 2) = (1—p,)IB@2)l = 


1 1 
.—. 
(17.18) rims P(d(k, Ar(xy) > x) < 


There is a specific value of r that belongs to at least a 1/m fraction of all x values, 
and so there are two values 0 < x1 < v2 < 6 with r(a1) = r(ae) = r and x > 


x1 +6/m. By (17.18), we have 


1 
P(d(k, Ar) > 2) > 
and i 
m 


The following claim is the main step in the proof. 
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Claim 2. There are integers r1,...,Th—-1 € [m] such that the matrix 
1 d(1,A,,) ... d(1,Ar_,) 
t PO Ag) ous OO5Ap,.:4) 
(17.19) Me) 
1) Why Ag) w« db Ag) 
satisfies 
volir(S' 
E(|det(M)|) > — 


To prove this claim, we use induction on k. For k = 1 the assertion is trivial. 


Let k > 1 and S’ = S\k. By induction, we have a choice of r1,...,7~—2 € [m] so 
that 
voler(S”) 
17.2 E(| det(M’)|) > ——_— 
( 7 0) (| de ( )I) = (16m)*-1? 


where M’ is the matrix constructed by for S’. We may assume (rearranging 
the rows if necessary) that det(M’) > 0. 

Consider the numbers 71,22 and r from Claim 1, and define rz,_, = r. Ex- 
panding det(M) by its last column, we get 


k 
det(M = 1)*-Fd(u;, Ar) det(M7*), 


where MJ* is the submatrix of M obtained by deleting row j and the last column 
k. We can write this as 


det(M) = d(k, A,) det(M"’)+Y, 


where Y is the sum of the first k—1 terms above. Here det(M") and d(k, A,) are 
independent random variables, but Y is not independent of either one of them. 

Recall the proof of Theorem [I7.7} there we forced independence by cutting off 
the outliers: we replaced the numbers d(j, A,) by min{6,d(j, A;)}. Here we want 
to do something similar, but just cutting off the tail would not work: while this 
did not increase the distances in the case k = 2, the volumes (determinants) could 
increase or decrease for k > 3. So we have to use a more tricky way to force some 
degree of independence of the distances from various given points. 

Let X = min{d,d(k, A,-)}, and let W be the event that B;1A, 4 @ for all 
j <k-1. This event is independent of X, since X depends only on the choice of 
A,. inside B,, but the event W depends only on how A, is chosen outside By, since 
B,; OB, = 9. Furthermore, if we condition on W, then Y will be independent of X, 
because under this condition d(j, A,) depends only on the choice of A, outside B;, 
for 7 < k, and the random variable det(M7") is independent of the choice of A,. 

Next we show that, conditioning on W, with probability at least 1/(4m), we 
have 


(17.21) | det(M)| > 2 det(M’). 


First, let us condition on W and also on the event Z = (Y > det(M’)(%, +22) /2). 
We know by Claim 1 that d(k,A,) > xq with probability at least 1/(4m); in this 
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case, X > £2 also holds. The random variable X is independent of events W and 
Z, so even under these conditions, we have 


d(k, Ap) > X > a> mt ta yo 


with probability at least 1/(4m). If this occurs, then 


Ly +292 


det(M) = d(k, A,) det(M’)+Y > d(k, A,) det(M’) det(M’) 


Li +22 ; 6 ; 
> (w2- $5) actu ) = 5 |det(M’)). 


Conditioning on W and on 7Z = (Y < det(M’)(a1 +22)/2), we conclude similarly: 
with probability at least 1/(4m), 


Ly +22 


det(M) = d(k, A,) det(M’)+Y < d(k, A,) det(M’) det(M’) 


<(21- 2) det(M’) 2a asta 
2 2m 
This proves (17.21). Hence 


E(|det(M)| | Ww) > ie — | det (M’)| = d(k, S’) voli+(S”) 
4m 2m 


8m2  (16m?)k-1 


voli: (S) 
~~ (16m?) 
To conclude, we use that 


E(| det(MZ)|) = E(|det(M)| | W)P(W)+E(|det(M)| | -W)(1—-P(W)) 
voli, (S') 
Bal =! t (Ma P >2 P ‘ 
> E(|det(an)| | W)POV) > 2 pw) 
To estimate the probability of W, we use that |B;| > |B,|, and so 


P(d(j, A,) > 6) = P(BjN.A, =) < P(BgN A, = 0) = P(d(k, Ay) > 8) < = 
Thus 


POW) a= 
J 
This completes the proof of Claim 2. 


P(d(j,A,) > 6) > 


Nl eR 


k 


Now we are ready to complete the proof. Let Al,..., A be random subsets 
with density p,. We assume that the sets AJ are independently generated for 
j=1,...,N, r=1,...,m. To each point u € V, we associate the row vector 

He = dt Ae FS n, NPS): 
VmN ? ? ‘a ? 9 ri ? 9 


Let S C V, |S| = &, and assume again that S = [k]. Consider the kx (m+1) 
matrix Y with rows yj,...,yz. Let A be the family of kx k submatrices of Y 
containing the first column. By the Binet-Cauchy Formula and elementary 
determinant transformations, we get 


(17.22) voly (5)? = om S det(U)?. 
“UECA 
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By Claim 2, there is a subset U such that 


1 volir(S) 
E(| det(U)|) > : 
((det(O) > Sane omy 
There are, in fact, not only one, but at least N(N—1)---(N—k+1) such sets. 
(The reason why not quite N* is because the integers r1,..., 74 in Claim 2 may 
not be different, but we need to pick different columns to insure probabilistic inde- 
pendence.) Hence 


N(N-1)---(N—k-+41) vokiy(S)? 
Nk 256m9K ° 

As N -— oo, the random variable voly(9)? will get more and more concentrated 
around its expectation, by and the Law of Large Numbers. (The terms 
in are not all independent, but can be partitioned into substantially large 
independent subfamilies; we leave verification of the details to the reader). The 
first factor in tends to 1. So if N is large enough, then 
voler(S) 
20m3* ° 
with probability arbitrarily close to 1. It follows that this inequality holds for all 
sets S € (Y) with high probability. 

The last step needed is to reduce the dimension, using Lemma [17.15] 


(17.23) E(voly($)*) > 


|voly(S)| > 


17.4. Bandwidth 


The theory of volume-respecting representations described above was devel- 
oped in in order to design an efficient approximation algorithm for the 
bandwidth of a graph. This application is described in this section. I believe that 
this deep theory will have further important applications. 


17.4.1. Bandwidth and density. Let G be a simple graph, and let d denote 
the graph-distance in G. The bandwidth wpa(G) of a graph G is defined as the 
smallest integer b such that the nodes of G can be labeled by 1,...,n so that 
|i—j| < b for every edge ij. The name refers to the fact that for this labeling of the 
nodes, all 1’s in the adjacency matrix will be contained in a band of width 2b+1 
around the main diagonal. 

The bandwidth of a graph is NP-hard to compute, or even to approximate 
within a constant factor [Dubey-Feige-Unger 2010}. As an application of volume- 
respecting embeddings, we describe an algorithm that finds a polylogarithmic ap- 
proximation of the bandwidth in polynomial time [Feige 1998]. The ordering of the 
nodes which approximates the bandwidth will be obtained by projecting the rep- 
resentation onto a random line, in a way reminiscent of the Goemans—Williamson 
algorithm in Section [13.3] 

We need some preliminary observations about the bandwidth. It is clear that 
if there is a node of degree D, then wpa(G) > D/2. More generally, if there are 
k nodes of the graph at graph-distance at most ¢ from a node v (not counting v), 
then wpa(G) > k/(2t). Let B(v,t) denote the set of elements at distance at most t 
from v, and define the local density of the graph G by 


Biv, t)|-1 
doc => dioc(G) => sc ee 
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Then 
1 
(17.24) Wbd > 7 Aloc- 


It is not hard to see that equality does not hold here, and the ratio Wpa/dioc can 
be as large as logn (see Example}17.18] below). 
We need the following related quantity, which we call the harmonic density: 


(17.25) dnar = dhar(G) = 7 


The following inequality establishes a relationship between these two density no- 


tions. 
(17.26) dioe < dhar < (1 +1n N)dioc- 


Indeed, for an appropriate t > 0 and ve V, 


|B(v,t)|-1 i. 1 
dice = ——,-—  & < a XS Ghar- 
; t ps d(u, v) a d(u, v) 


u€ B(v,t)\v 


On the other hand, dpar is the sum of n—1 positive numbers, among which the 
k-th largest is at most dioc/k by the definition of the local density. Hence 
n—-1 n-1 


dhar < », Atos = om ~) dice 


which proves the second inequality in (17.26). 
The following examples show that the above bounds cannot be improved (ex- 
cept for the constants), but also that the bandwidth can be a quite tricky quantity. 


Example 17.18. Consider the path P,, on n nodes. Clearly wya(P,) = 1. Since 
|B(v,t)| < 2t+1, and equality holds for t < n/2, we have dioc(P,) = 2. Thus 
inequality (17.24) is sharp for paths. Furthermore, for odd n and the midpoint v 
of the path, we have 
1 (n—1)/2 1 
dnar = — =2 —~ 2Inn, 
h = d(u,v) >> k a 
uEeV\v k=1 
showing that the upper bound in (17.26) is sharp up to a constant factor. 4 
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FIGURE 17.4. Optimal labeling of a binary tree for bandwidth, 
specified as a labeling and as a matrix. 
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Example 17.19. Let 7; denote the full binary tree of depth r. This tree has 
n = 2"*+'_1 nodes, and if v denotes the root, then |B(v,7r)| =n; it is easy to see 
that this is the choice of v and r defining the local density, so 
n—-1 artl_9 dice = n—-l 27-1 

= and Wbd > = = . 
r 2 2r r 
The bandwidth of [(m—1)(2r)] can indeed be realized [Chung 1997] (see Figure 


and also Exercise|17.8). The harmonic density is given by 


ioc =< 
r 


=) Otel 
dhar = > 2) ra See : 
nl 
This shows that the lower bound in (17.26) is asymptotically sharp. 4 


We conclude this preparation for the main result with a couple of inequalities 
relating the harmonic density to the tree-volume. Specializing from the previous 
section, we recall the notation Ip = |],,cm7) 4(u, v) for a tree T on some nodes 
of the graph G (not necessarily a subgraph of G). 


Lemma 17.20. Let G be a graph on n nodes and T, a tree on k nodes. Then 


< ndhar (G)F-} : 


b:V(T)SV Ter) 


Proof. This is clear for k = 1. Let k > 1. Select any leaf v of T’, denote its 
neighbor by u, and set 7’ = T\v and ¢' = ¢|y7). Then 


1 1 1 
» os os Tyr d(a, d'(u)) = > dhar- 


7 Tyr 


From this the lemma follows by induction. 


As an application of this bound, we prove: 


Lemma 17.21. For every graph G on n points, 


S- a < (k—1)!n (4dnar)®—?. 


Se(X) 


Proof. We use the elementary fact that the number of isomorphism types of 
trees on k nodes is bounded by 4*~1 (see e.g. [Lovasz 1991], Problem 4.18). So 
summing the bound in Lemma|17.20] over all isomorphism types of trees, we get 


> > as < nlAdua.)” *; 


T pviPov PF) 


and hence 
1 (k—1)! 1 
——aan = (k-1)! 
Se(X) Se(X) V(T)=S 


< (k—1)!n (4dpar)*7?. 
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17.4.2. Approximating the bandwidth. Now we come to the main theo- 
rem describing the algorithm to get an approximation of the bandwidth of a graph. 


Theorem 17.22. Let G be a graph on n nodes, let n > 1, let k = [Inn], and let 
x: VR” be a contractive vector labeling such that volx(S) > 7'~Fvolix(S) for 
every k-set S C V. Project the representing points onto a random line L through 
the origin, and label the nodes by 1,...,n according to the order of their projections 
on the line. Then with high probability, |i—j| < 480ndnar logn for every edge ij. 


Proof. Let y; be the projection of x; on L. Let ij be an edge; since x is 
contractive we have |x;—x,| < 1, and hence by it follows that with high 
probability, |y;—y,| < 2|x;—x,|//m < 2/\/m for all edges ij. We will assume 
below that this is indeed the case. 

We call a set S C V squeezed, if diam(y(S)) < 2/\/m. So every edge is 
squeezed; but we are interested in squeezed k-sets, which we count in two different 
ways. Let X be the number of squeezed k-sets. 

By (with m in place of n, kK—1 in place of d, and 2/,/m in place of S), 
we have for every k-set S, 

. (m—k+1)12_ytm_—K41 
(17.27) P(S' is squeezed) < mF /2q,, Tyo, (8) < ex 


Using Lemma }17.21| we can bound the expected number of squeezed k-sets. 
Since n < 3* by the choice of k, we get 


40n \k-1 1 
E(X) < (—) k a 
(X)s k-1 2 volir(S) 
Se(x) 
4 k-1 
< (—) (k—1)!n(Adnar)*? < (240ndpar)*7?. 
On the other hand, let ij € E (i < j). Every k-subset of {i,i+1,...,7} is squeezed, 


and so ‘ 4) 
jo-tt+ =" 
xX> 
> ( , )> : 


(the last step is valid for 7-7 > k—1, which we may assume), and comparing with 
the upper bound, we get 


€((2=2)°*) < 10nd) 


Using Markov’s Inequality and taking (k —1)-st roots, we get with high probability 
that 


j-t < 480ndhark. 


As a consequence, we get that dj, approximates Wpq up to a polylogarithmic 
factor: 


Corollary 17.23. For every graph with n points, its bandwidth and local density 


satisfy 
dice < Wha < O((log n)?y/log log Nn) dioc- 


Remark 17.24. We can generalize the notion of bandwidth to any finite metric 
space (V,d): the parameter Whg = Wpa(V,d) is the smallest real number a such that 
there is a bijection f : V — [n] such that |f(¢)—f(/)| < ad(i,7) for all i,7 € V. 
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It takes a minute to realize that this notion does generalize graph bandwidth: the 
definition of bandwidth requires this condition only for the case when 77 is an edge, 
but this already implies that it holds for every pair of nodes due to the definition 


of graph distance. (We do loose the motivation for the word “bandwidth”.) 


Most of what we proved in this section extends to the more general setting of 


finite metric spaces; see Exercises |17.11}}17.13 


Exercise 17.1. Let (V,d) be a metric space. Prove that (V, Vd) is a metric space. 
Exercise 17.2. The graph metric C4 is ¢;-embeddable, but not ¢2-embeddable. 


Exercise 17.3. The graph K2,3 is not ¢;-embeddable. 
Exercise 17.4. A tree-metric is a graph metric defined by an edge-weighted tree. 
Prove that every tree-metric is £;-embeddable. 
Exercise 17.5. A metric space (V,d) is called hypermetric, if for every k > 1, 
every 2k+1 points pi1,...,Dke+1,91,---;Qk € V satisfy 

YS dipups)+ SS dias S> d(pi.g). 


1<i<j<k+1 1<i<j<k 1<i<k4+1 
1<j<k 


Prove that every ¢;-embeddable metric is a hypermetric. 

Exercise 17.6. Prove that every finite metric space (V,d) with n points has a 
contractive vector-labeling x such that volx(V) > 2!~"/?voli:(V). 

Exercise 17.7. Let (V,d) be a finite metric space and v € V. Prove that (V,d) 
is isometrically embeddable in a Euclidean space if and only if the (V\v) x (V \v) 
matrix A defined by Ai; = d(v, i)? +d(v, 7)? —d(i, 7)? is positive semidefinite. 
Exercise 17.8. Prove that the bandwidth of the complete k-ary tree of depth r 
is [(k"t* —k)/(2r(k—1))]. 

Exercise 17.9. Let T be a spanning tree on a matrix space (V, d), and let i,7 € V, 


and let T1,...,7% be the spanning trees that can be obtained from T by adding 
the edge 77 and deleting one of the other edges of the arising cycle. Prove that 


il 1 1 
MT) ~ ay) tT hy 


Exercise 17.10. For every spanning tree T on a matrix space (V,d), we have 


1 1 
ae a 
mn <> me) 
where P ranges over all spanning paths on V. 
Exercise 17.11. Show how to construct for every finite metric space (V,d) an- 
other finite metric space (V,d’) such that 1 < d’(i,7) < n and Wpa(V,d) = 
Wa (V, d'). 


Exercise 17.12. Define the notions of local and harmonic densities for every 


finite metric space, and prove that inequalities (17.24) and (17.26), and Lemmas 
17.20) and [17.21}remain valid. 


Exercise 17.13. Generalize Theorem |17.22|to finite metric spaces. 


CHAPTER 18 


Matching and Covering in Frameworks 


In this chapter we study vector labelings from their purely linear algebraic 
perspective—where only linear dependence and independence of the underlying 
vectors play a role. Replacing “cardinality” by “rank” in an appropriate way, we 
get generalizations of ordinary graph-theoretic parameters and properties that are 
often quite meaningful and relevant. Furthermore, these results about frameworks 
can be applied to purely graph-theoretic problems, where the vector-labeling is used 
to encode information about other parts, or additional structure, of the graph. 

We restrict our attention to the study of two basic notions in graph theory: 
matchings (sets of pairwise disjoint edges) and node-covers (sets of nodes that 
meet all edges). For a graph G, we denote by v(G) its matching number, the maxi- 
mum cardinality of a matching, and by 7(G) the node-cover number, the minimum 
cardinality of a node-cover. It is clear that to cover the edges of a matching we 
need to use different nodes, and hence 


(18.1) v(G) < 7(G). 


While these problems are in a sense dual to each other, their complexity is quite 
different even for ordinary graphs: the matching problem is polynomially solvable, 
while the node-cover problem is NP-hard. 

The situation is similar for frameworks (G,v) where v: V — R¢. To avoid 
some trivial complications, let us assume that the graph has no isolated nodes 
and the node positions (vector labels) generate the whole space R¢. For S C V, 
we define v(S) = {v; : i © S} and (whenever there is only one vector labeling 
around) rk($') = rk(v(S)). The node-cover number 7(G,v) of a framework is the 
minimum rank of a node-cover. A matching in (G,v) is a set of node-disjoint 
edges such that all their endnodes are labeled by linearly independent vectors, i.e., 
rk(V(M)) = 2|M|. We denote by v(G,v) the maximum cardinality of a matching. 

Ordinary graphs can be considered as the special case when nodes are labeled 
by linearly independent vectors. We denote such a vector labeling by Ugee (since 
we are only concerned with the linear independence structure of the vector labels, 
it does not matter which linearly independent set of vectors we use). It is easy to 
check that 7(G, U free) = T(G) and v(G, Ufc) = v(G). 

The extension of inequality to frameworks remains valid: 


(18.2) (G,v) <7(G,v). 


In this chapter, we formulate extensions of basic characterization results like 
Konig’s and Tutte’s matching theorems to frameworks. Several of these results 
are known from matroid theory, and have many applications in graph theory and 
other combinatorial problems. The node-cover number 7 is NP-hard even in the 
case of ordinary graphs, so no similarly complete results can be expected; however, 
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the geometric methods providing partial results will be important in the proof of a 
classification theorem of cover-critical ordinary graphs (whose statement does not 
involve vector labelings). 

While the results on matchings are more complete and perhaps more important, 
we start with a study of covering, since some results developed for covering will be 
useful in the study of matchings. 

Since only the linear structure is used, most of the considerations in this chapter 
could be formulated over any field (or at least over any sufficiently large field). 
Indeed, most elementary arguments would work over any matroid; however, two of 
the main results (Theorems [18.8{b) and 18.15) do make use of the additional linear 
structure provided by the underlying field. 

Let us describe some examples of interesting matchings and coverings in frame- 
works. 


Example 18.1 (Branchings). Let G be a directed graph; we define an (unori- 
ented) framework (H, v) as follows. For every edge e = ij € E, we create two nodes 
he and t, of H, and connected them by an (unoriented) edge €. We label h. by the 
unit vector v(he) = e,, and t. by the difference v(t.) = ej+n—€itn- 

In the framework (H,v), edges €1,...,é€, form a matching if and only if no 
two of the edges e1,...,e% enters the same node of G (equivalently, the vectors 
v(he) are linearly independent) and they form a forest in G in the unoriented sense 
(equivalently, the vectors v(t.) are linearly independent). Such a set of edges is 
called a branching. It follows that v(H,v) < n—1; if equality holds for some set 
of edges, then they form a spanning branching or arborescence: a spanning tree (in 
the undirected sense) which has a root r so the branching contains a directed path 
from the root to every other node. So every matching with n—1 edges in (H,v) 
corresponds to a spanning branching in G, and vice versa. 4 


18.1. Cover-critical ordinary graphs 


We start with recalling some basic facts about ordinary graphs and their node- 
cover numbers. A graph G is called cover-critical, if deleting any edge from it, 
T(G) decreases. This is equivalent to saying that deleting any edge, a(G) increases, 
and for this reason these graphs are often called a-critical. Since isolated nodes 
play no role here, we will assume that cover-critical graphs have no isolated nodes. 
These graphs have an interesting theory, initiated in [Erdés—Gallai 1961). See also 

There are some easy but very useful observations we can make. In a cover- 
critical graph G every node v is contained in a minimum node-cover. Indeed, if we 
delete v, the rest of the graph can be covered by 7(G)—1 nodes; adding v to this 
set, we get a minimum node-cover containing v. It is just slightly more complicated 
to see that every node is missed by some minimum node-cover. 

For a cover-critical graph G, the following quantity, called its covering defect, 
will play a crucial role: 6(G) = 27(G) —n = n—2a(G). The next theorem summa- 
rizes some of the basic properties of cover-critical graphs. 


Theorem 18.2. For every cover-critical graph G, 
(a) [Erdés—Gallai 1961] 6(G) > 0; in other words, n < 2r(G). 


(b) [Erd3s—Hajnal-Moon 1964] m < (7'9)*?). 
(c) [Hajnal 1965] Every node has degree at most 6(G) +1. 
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FIGURE 18.1. Some cover-critical graphs 


These results will follow easily from more general considerations concerning 
cover-critical frameworks. 

Cover-critical graphs can be classified by their defect. Since a graph is cover- 
critical if and only if each of its connected components are cover-critical, we may 
restrict our attention to connected graphs. The only connected cover-critical graph 
with 6 = 0 is Ky [Erdés—Gallai 1961], and the only connected cover-critical graphs 
with 6 = 1 are odd cycles (this follows from Hajnal’s Theorem [18.2{c)). 

The fact that all odd cycles are cover-critical with the same defect motivates 
the following simple observation: if G’ is obtained from G by subdividing an edge 
by two nodes, then G’ is cover-critical if and only if G is, and 6(G) = 6(G’). By 
this fact, for 6 > 2 it suffices to describe those cover-critical graphs that do not 
have two adjacent nodes of degree 2. 

There is a slightly less trivial version of this observation: Let us split a node of a 
graph G into two (both having positive degree), and connect both of them to a new 
node. The resulting graph G’ is cover-critical if and only if G is, and 6(G) = d(G’). 
Together with Exercise [18.1] this implies that it is enough to describe connected 
cover-critical graphs with minimum degree at least 3. Let us call these graphs basic 
cover-critical. 

There is only one basic cover-critical graph with 6 = 2, namely Ky 
‘Andrasfai 1967], and four basic cover-critical graphs with 6 = 3, the last four 


graphs in Figure Suranyi 1973]. This motivates the following theorem 
Lovasz_ 1978): 


Theorem 18.3. For every 6 > 2, the number of basic cover-critical graphs with 


d(G) = 0 is finite. 


The proof will show that every basic cover-critical graph has at most 23°” nodes. 
(This bound is probably very rough.) The proof uses a generalization of the notion 
of cover-critical graphs to frameworks, and is postponed to section |18.2.4 


18.2. Covering in frameworks 


18.2.1. Extending the rank function. Our goal is to generalize the notions 
of covers and criticality to frameworks [Lovasz 1977], (Lovasz 1978}. We start with 
extending the notion of rank from sets to vectors. 

Let u: V — R? be a vector labeling of a finite set V. We can think of the rank 
function rk(S) = dim (u(S)) as a function defined on the subsets of V, but also as 
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a function defined on the set of all 0-1 vectors in RY. We extend the definition to 
all vectors w € RY as follows. There is a unique decomposition 


t 
(18.3) w=) ails, a>0, 9,C520---C Sy. 
i=1 
(Here t = 0 if w = 0.) To construct this decomposition, we let S; be the set of 
nodes with maximal w;, and let a, be the difference this maximal w; and the next 
largest entry of w. Then w—aj,1 5, has more maximal entries. We repeat this until 
only zero entries are left. In other words, we choose the level-sets of w for the sets 
Sj. 
Using this decomposition, we define 


(18.4) rk(w) = » ayrk(S;). 


If w is 0-1 valued, then this is just the rank of its support. 

The decomposition is unique, but it does not remain unique if we drop 
the condition that the sets S$; form a chain. In other words, we have many decom- 
positions of the form 

t 
(18.5) w=) alr, a >0. 
i=1 
For such decompositions, we can still assert an inequality: 


Lemma 18.4. For any decomposition of a vector w € RY of the form (18.5), we 
have rk(w) < )°, airk(Z;). 


In other words, among all representations of w in the form (18.5), the value 
>>, aik(Z;) is minimized when the sets T; form a chain. 


Proof. The proof is a straightforward application of a technique called “un- 
crossing”. Suppose that there are two incomparable sets JT; and T;. Let, say, 
a; <a;. Let us remove J; from the family, but add 7;N7; and 7;UT} with co- 
efficient a;, while keeping T; with coefficient a;—a,. (If either one of T; 97; and 
T;UT; was already used in the representation, we add a; to their coefficient. If 
Q; = a;, we remove Tj as well.) Since 


17, + Iz, — Lnnar; + 1rur;; 
this way we get a expression for the same vector w, and since 
this representation is not worse than the one we started with. 
To conclude the proof, it suffices to argue that repeatedly performing this 
uncrossing operation, we end up with a representation where the sets 7; form 


a chain in a finite number of steps. This follows from the observation that the 
quantity 5°, a;|T;|? is strictly increasing at each uncrossing step. 


Corollary 18.5. The extended rank function rk : [0,1])” — R is homogeneous, 
subadditive and convex. 


This corollary can be easily derived from the results of [Edmonds 1970]; this 
form was stated in |Lovasz 1983a]. 
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Proof. It is clear that rk is homogeneous. To prove that it is subadditive, we 
prove that for w,w,,w2: [0,1])” > Rand \ > 0, 


rk(Aw) = Ark(w) (A> 0), rk(wy + we) < rk(w,)+rk(wo). 
The first equation is trivial, since the same sets S; occur in the decomposition (18.3) 
of w and Aw. If we substitute for w; and w2 their representation (18.3), we get 
a decomposition of w;+we of the form (18.5), and so the inequality follows by 


Lemma [18.4 
Finally, convexity is a consequence of homogeneity and subadditivity. 


18.2.2. Multiple covers. A k-cover in an ordinary graph is a weighting w 
of nodes by nonnegative integers such that for every edge, the sum of weights of 
the two endpoints is at least k. For a node-cover T, the indicator function I7 is a 
1-cover (and every minimal 1-cover arises this way). 

It is a bit more complicated to describe 2-covers. For every set TC V, define 
the weighting 

2p = 1rt+ In(wr) . 
(Recall that N(V\T) is the set of all nodes having a neighbor in V\T, so it may 
contain nodes of T as well as nodes of V\T). We claim that 27 is a 2-cover of G 
for every set T C V (Figure [18.2). If both endpoints of an edge are covered by T, 
then this edge is covered twice. If one of them is in 7’, but the other is one not, then 
the first endpoint is in N(V\T), and so it has weight 2 in 27. If neither endpoint 
is contained in T, then both of them are covered by N(V \T). 


sessmeces 


FIGURE 18.2. A set T of nodes and the 2-cover 27 constructed 
from it. 


If T = V, then 27 = ly. The same holds of T = 0, provided G has no 
isolated nodes. This illustrates the important fact that 27 does not depend on T 
in any monotone way. If T is a node-cover, then we have N(V\T) C T, and so the 
support of 2p is T. If T is a minimal node-cover (with respect to inclusion), then 
N(V\T) =T, and so 27 = 217. 

If we allow the trivial operation of increasing some of the weights, then this 
construction will produce all 2-covers. Indeed, for a 2-cover w, let T be its support. 
Then T must be a node-cover, and all nodes in N(V\T) must have weight 2, so 
w > 2r. This implies that w is a 2-cover if and only if there is a set J such that 
w > 2+. If the 2-cover w is minimal (i.e., no node-weight can be decreased and still 
have a 2-cover), then w = 27 for some set T. This set T is by no means unique; 
for example, it can always be chosen from among node-covers. 

One would expect that to describe k-covers for larger values of k becomes 
increasingly complicated, perhaps impossible. But in fact knowing 1-covers and 
2-covers gives us all. 

The sum of a k-cover and an J-cover is, trivially, a (k+1)-cover. A 2-cover is 
not necessarily the sum of two 1-covers (for example, the all-1 weighting of the 
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nodes of a triangle). But every 3-cover is the sum of a 2-cover and a 1-cover; more 
generally, for k > 2 every k-cover w = (w;: 7 € V) can be decomposed into the 
sum of a 2-cover w’ and a (k—2)-cover w” by the following simple formulas: 


2. if Wi = k, 
(18.6) we=41, ifl<w,<k-1, w, = wj-w}. 
0, if Wi = 0, 


It is easy to check that w’ is a 2-cover and w” is a (k—2)-cover. Repeating this 
argument, we can decompose every k-cover as a sum of |&/2| 2-covers and at most 


one 1-cover (Figure |18.3). 
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FIGURE 18.3. Decomposing a 5-cover into two 2-covers and a 1-cover. 


The notion of k-covers is independent of the vector labeling, but the geometry 
enters when we replace “size” by “rank”. Recalling the definition (18.4), we see 
that the rank of the 2-cover defined by a node-cover T is 


(18.7) rk(2r) = rk(f)+rk(N(V\T)). 


In the decomposition (18.6) of a k-cover, every level-set of w’ or w” is a level-set 
of w, and hence we can use the level sets of w to define the ranks of w’ and w” 
(some of them may have coefficient 0). This implies that 


(18.8) rk(w’)+rk(w”) = rk(w). 

The k-cover number 7;,(G, v) for a framework (G, v) is defined as the minimum 
rank of any k-cover. For the (most important) case of k = 1, we set r(G,v) = 
71(G,v). In other words, 7(G,v) is the minimum dimension of a linear subspace 
L CR? such that at least one endpoint of every edge of G is labeled by a vector in 
L. 

Since the sum of a k-cover and an m-cover is a (k+m)-cover, we have 
Th+m(G,Vv) < Th(G,Vv)+7m(G, v); in particular 
(18.9) Th (G,v) < kr(G,v). 

This implies that the difference 

0(G, v) = 27(G, v) —72(G, v) 
is nonnegative. We call this quantity the covering defect of the framework (G, v). 
(This quantity could be described as an “integrality gap” in integer programming; 
cf. Exercise [18.4]) The covering defect is defined for all frameworks, in particular 
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for ordinary graphs, and it is not hard to see that 6(G, ufgee) = 6(G) for all cover- 
critical graphs G. 

The following simple lemma is the key to the study of cover-critical graphs and 
frameworks. 


Lemma 18.6. Let S be any node-cover in a framework (G,v), and let T be a 
node-cover with minimum rank. Then rk(2sur) < rk(2s). 


Proof. If we decompose the 3-cover w = I7+2s by (18.6), we get w = 
w’+w”, where w’ = 2sur is a 2-cover and w” is a l-cover. Using the convexity 
of the rank function and (18.8), we get 


rk(T) +rk(2g) > rk(w) = rk(2sur)+rk(w”). 
Since rk(T) < rk(w”) by the minimality of rk(T), this implies the lemma. 


Applying the lemma repeatedly, we get that if S is a node-cover, and T),...,T 
are node-covers with minimum rank, then 


(18.10) rk(2suT,u---uT; ) < rk(2s). 


This inequality will be most useful when there are many node-covers with minimum 
rank, which will happen in the “cover-critical” case, to be discussed next. 


18.2.3. Cover-critical frameworks. A framework is cover-critical, if delet- 
ing any node or edge, its node-cover number decreases. This condition implies, in 
particular, that the graph has no isolated nodes, and no node is labeled by the zero 
vector. 

If (G, v) is cover-critical, and we project the representing vectors onto a generic 
7(G)-dimensional space, then ranks up to r(G) don’t change, and hence the result- 
ing framework is still cover-critical. Hence we could always assume that the repre- 
sentation is in T(G)-dimensional space. Informally, there is no hyperplane meeting 
every edge, but deleting any edge, there will be. (This reduction is, however, not 
always convenient.) 


FIGURE 18.4. Some cover-critical frameworks. The figures are 
drawn in projective plane, so that 1-dimensional linear subspaces 
are depicted as points, 2-dimensional linear subspaces are depicted 
as lines etc. The reader is recommended to use this way of drawing 
frameworks to follow the arguments in this chapter. 


352 18. MATCHING AND COVERING IN FRAMEWORKS 


A framework (G, Ufree) is cover-critical if and only if G is cover-critical as an 
ordinary graph. This way the theory of cover-critical frameworks generalizes the 
theory of cover-critical graphs. 

Some basic results concerning cover-critical graphs can be generalized to the 
vector-labeled case without difficulty. For example, every node is contained in a 
minimum rank node-cover, and missed by another minimum rank node-cover. 

There are properties that are trivial in the unlabeled case, but need a proof in 
our case. 


Lemma 18.7. In a cover-critical framework (G,v), the neighbors of any node are 
represented by linearly independent vectors. 


Proof. Let b € N(a). By criticality, 7(G\ ab, v) < 7(G,v), and hence there is 
a (7(G, v) —1)-dimensional subspace L that contains one of the endnode positions 
of every edge except ab. Clearly, va,vp ¢ L (otherwise L would be a node-cover 
of G). Hence L must contain v; for every 7 € N(a), 7 # b. It follows that 
vp ¢ lin{v; : j € N(a)\{b}}. This holds for every 6 € N(a), showing that the 
vectors vy (b € N(a)) are linearly independent. 


The last lemma implies that the degree of each node is at most rk(V). We 
are going to prove a much sharper bound on the degrees, as part of the following 
generalization of Theorem [18.2] to frameworks. For a € V, set €g = rk(V) —rk(V \ 
{a}). In other words, ¢, = 1 if vq is linearly independent of the vectors {v; : j 4 a}, 
and €, = 0 otherwise. 


Theorem 18.8. Let (G,v) be a cover-critical framework anda €V. Then 
(a) rk(V) = 72(G,v). 


(b) m< oo), 
(c) deg(a) < 6(G,v)+<éa. 


Proof. (a) follows easily from our considerations about multiple covers. Since 
V is a 2-cover, we have 72(G,v) < rk(V). On the other hand, let T),...,T, 
be all minimum node-covers, and $, a further node-cover. Then, as remarked, 
T,U---UT;, = V, and so by (18.10), we have 


rk(2s) Zz tk(2suru.--uT;, ) — rk(2y) = rk(V). 
This proves that rk(V) = 72(G, v). 

(b) Let (G,v) be a cover-critical framework in dimension d. We may assume 
that d = 7(G,v). Consider the matrices M,; = Viv] +VjV) (ij € EF). We claim 
that these matrices are linearly independent. 

For any edge ab, the graph G\ab has a node-cover T that does not span the 
space R?; let u € R? (u ¥ 0) be orthogonal to the span of T. Clearly u'v, 4 0 


and u'v, # 0, else {7 € V: u'v;} =0 would be a set covering all edges of G of 
rank d—1<7(G). Then 

u'M,ju= ulv;vju+u'vj;vju = 2(u'v,)(u'v;) =0 
for every edge ij 4 ab (since one of ¢ and 7 must be in T), but 


u'M,yu = 2(u'v,)(u'v,) 4 0. 
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This shows that Map is linearly independent of the matrices Mj;, ij A ab. Since ab 
was an arbitrary edge, this implies that the matrices M;; are linearly independent. 
Hence their number cannot be larger than the dimension of the space of symmetric 
dxd matrices, which is em? 


(c) Let T C V\a be a node-cover. We claim that 
(18.11) deg(a) < rk(2r)—rk(V)+€,. 


Applying this to an optimal node-cover not containing a, and using (a), assertion 
(c) of the theorem will follow. We prove this inequality by induction on |V \T]. 
Let S1,..., 5% be all node-covers of minimum rank that do not contain a, and 
let T’ = TUS)U---US;,. By Lemma|I8.6] we have rk(27-) < rk(27). 
If T’ =V\a, then 2p = 1p +1 ya), and so by Lemma|18.7| 


deg(a) = rk(N(a)) = rk(27-) —rk(T") < rk(2r) —rk(V) +e, 


and we are done. So suppose that A = V\T’ 4 {a}, and so B = A\a is nonempty. 
Note that A and B are stable sets of nodes. 


Claim. Let i € N(a). Then i ¢ N(B), and v; is linearly independent of the set 
v(N(A)\ i). 


Indeed, by the cover-critical property of (G,v), the framework (G\ai,v) has 
a node-cover Z with rank 7(G,v)—1. Clearly v, and v; are linearly independent 
of v(Z). The set ZU {i} is a node-cover of G with minimum rank, and hence it is 
one of the sets S;. By the definition of A, it follows that ZN A =. Since Z covers 
all edges except ai, there is no edge connecting i to B, and N(A)\i C Z. So v; is 
linearly independent of v(NV(A)\i). This proves the Claim. 

The Claim implies that rk(N(A)) = rk(N(B)) +deg(a). Let b € B. Applying 
with V \ B in place of T (which we can do by the induction hypothesis), we 
get 


deg(b) < rk(2y\g)—rk(V)+ep = rk(V \ B) +rk(N(B)) —rk(V) +25. 
Since deg(b) > 1 and rk(V\B) < rk(V\A)+1, this implies that rk(N(B)) > 
rk(V) —rk(V \ A) —e», and hence 
deg(a) = rk(N(A)) —rk(.N(B)) < rk(N(A)) +rk(V \ A) —rk(V) +20 
= rk(27-) —rk(V) +e, < rk(2r) —rk(V) +e. 
We are done, unless €¢, = 0 and ¢, = 1 for all b € B. In this latter case vy 
depends linearly on {vj 797 € V\a}, but in this dependence the elements of B 


cannot be involved. So vg depends linearly on {v; : 7 € V\A}. This shows that 
rk(V \ B) = rk(V \ A), and we gain 1 in the above computation, proving (18.11). 


Assertion (a) implies that the set V is an optimal 2-cover, and 
(18.12) rk(V) < 27(G, v). 


If equality holds in (18.12), then part (c) of the theorem implies that deg(a) = 1 and 
Eq = 1 for every node a, and so G is a matching. This characterizes cover-critical 
frameworks with 6 = 0. 

Theorem [18.8{b) generalizes Theorem [18.2{b), giving a good upper bound on 
the number of edges. What about the number of nodes? Since we do not allow 
isolated nodes, a cover-critical framework in R@ can have at most d(d+1) nodes 
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by Theorem b). One cannot say more in general, since from every cover- 
critical framework we can construct another one by splitting every node to nodes 
of degree 1, labeled by the same vector. From the complete (d+1)-graph with the 
free labeling, we get a framework (G,v) with d(d+1) nodes and 7(G, v) = d. 

The next step should be to generalize Theorem|[18.3]to frameworks. However, it 
seems that the structure of cover-critical frameworks can be much more complicated 
than the structure of cover-critical ordinary graphs, and a classification theorem 
seems to be out of reach. But, somewhat surprisingly, Theorem and some 
other results about “ordinary” cover-critical graphs will be proved using vector- 
labelings. 

Before returning to ordinary graphs, we need a couple of technical lemmas, 
nothing deep, but involving some case-distinction. The first one analyzes how 
changing the label of a single node in certain ways influences the cover-critical 
property. Given a linear subspace L C R¢ and a finite set S C R%, we say that a 
point x € L is in general position on L relative to S, if x € lin(.S’) for some S’ C S$ 
only if ZL C lin(S’). It is clear every subspace contains a point in general position 
relative to any finite set. 


Lemma 18.9. Let (G,v) be a framework in R¢, let 7 € V, and construct a new 
labeling ¥ by replacing v; by a vector ¥; in general position on lin(v(N(j))) relative 
to v(V\j) (we keep every other position). Then t(G,¥) > 7(G,v). If, in addition, 
the original label v; is in general position in R¢ relative to v(V \j), then T(G, Vv) = 
7(G,v). If, in addition, (G,v) is cover-critical, then (G,v) is cover-critical as well. 


Proof. Let T be a node-cover of G for which rk(¥(T)) is minimal. If 
rk(v(T)) < rk(¥(T)), then r(G,v) = rk(v(T)) < rk(¥(T)) < 7(G,¥), and we are 
done. So suppose that rk(v(Z')) < rk(v(Z)), then j € T and ¥; € lin(v(T'\j)). 
By the assumption that Vv; is in general position on lin(v(V(j))), it follows that 
lin(v(N(j))) € lin(v('\j)). But then T’ = T\jUN(j) is a node-cover of G with 
rk(v(Z")) =rk(v(T'\j)) < rk(¥(T)) = 7(G, ¥), a contradiction. 

Now suppose that v; is in general position relative to v(V \j), and let T be a 
node-cover of G for which rk(v(T)) is minimal. If j ¢ T, then 7(G,¥) < rk(¥(T)) = 
rk(v(T')) = 7(G,v). Suppose that j € T. By the assumption about v,;, we have 
either v; ¢ lin(v(T'\j)), or lin(v(T\j)) = lin(v(V \j)). In the first case, 


7(G,¥) <rk(¥(T)) < rk(v(T\j)) +1 = rk(v(T)) = 7(G,v). 
In the second, 
7(G,¥v) <rk(v(V\j)) =rk(v(T\j)) < rk(v(T)) = 7(G,v). 


Finally, suppose that G is cover-critical, and let e € E. If e is not incident with 
j, then the same argument as above gives that 7(G'\e,¥) = 7(G\e,v) < 7(G,v) = 
1(G,¥). Ife is incident with j, then a node-cover T of G\e with rk(v(T)) < r(G, v) 
cannot contain 7, and hence 


7(G\e,¥) <rk(¥(T)) =rk(v(T)) < r(G,v) = 7(G,¥). 


The second technical lemma deals with conditions under which deleting a node 
preserves the cover-critical property. 


Lemma 18.10. Let (G,v) be a framework, and let j be a node such that v; € 


lin(v(N(j))). Let v’ be the orthogonal projection of v onto the hyperplane v7: 
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Then 7(G\j,v’) = 7(G,v)—1. If, in addition, (G,v) is cover-critical, then so is 
(G\j,v’). 

Proof. Let T be a node-cover of G\j with rk(v/(T)) = 7(G\j,v’). Then 
TU{j} is a node-cover of G with rk(v(T)) < rk(v/(T)) +1, showing that r(G,v) < 
7(G\j,v’) +1. 

Conversely, let T be a node-cover of G with rk(v(T)) = 7(G,v). The node- 
cover T’ contains either j or N(j). In both cases, vj € lin(v(T)). Since T\j is a 
node-cover in G\j, this implies that 


7(G\j,v') < rk(w/(T\39)) =rk(v'(L)) < rk(v(T)) = 7(G,v). 

Finally, assume that (G,v) is cover-critical, and let e € E(G\ 7). There exists 

a node-cover T of G\e with rk(v(T)) = r(G,v)—1. It follows as above that 
rk(v/(T)) = 7(G,v)—2 = 7(G\j,v’)—1. This shows that (G\j,v’) is cover- 
critical. 


18.2.4. Classifying cover-critical graphs. Combining the previous lemmas 
with Theorem [18.8] we get an important corollary for ordinary graphs, saying that 
a large cover-critical graph with small cover defect is almost bipartite: a minimal 
cover T’ spans a small number of edges only, while its complement is, of course, a 
stable set. 


Corollary 18.11. Let G be a cover-critical graph, and let T C V_ be a node-cover. 
Then 


2 


In particular, if T is a minimum node-cover, then 
d(G)+1 
cary s (™)*"), 


Proof. Let A = V\T. Starting with the framework (G,ufree), we apply the 


constructions in Lemmas and |18.10} we replace v; (i € A) by a point ¥; € 


v(N(i)) in general position, and then delete i and project the remaining labels 
onto v;. Repeating this with all nodes of A, we are left with a cover-critical 


framework (G[T],u) with 7(G[Z],u) = 7(G)—|A|. An application of Theorem 
b) completes the proof. 


We need one further lemma, which is “pure” graph theory. 


\A(G[T))| < ee 


Lemma 18.12. Let G be a bipartite graph, and let S),...,5% GC V be such that 
G\S; has a perfect matching for every i. Assume, furthermore, that deleting any 
edge of G this property does not hold any more. Let s = max; |S;|. Then the number 
of nodes of G with degree at least 3 is at most 2s° (he 


Proof. Suppose that G has more than 2s° () nodes of degree at least 3. Let 
us label every edge e with an index 7 for which G\e\S; has no perfect matching, 
and let M; denote a perfect matching in G\.$;. Clearly all edges labeled i belong 
to M;. 

Let us label every node with degree at least 3 by a triple of the edge-labels 
incident with it. There will be more than 2s? nodes labeled by the same triple, say 
by {1, 2,3}. 
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The set M,U Mp consists of the common edges, of cycles alternating with re- 
spect to both matchings, and of alternating paths ending at S;US. A node labeled 
{1, 2,3} cannot belong to a common edge of M; and Mg (trivially), and it cannot 
belong to an alternating cycle, since then we could replace the edges of M, on 
this cycle by the other edges of the cycle, to get a perfect matching in G\ S; that 
misses an edge labeled i, contrary to the definition of the label. So it must lie on 
one of the alternating paths. The same argument holds for the sets M,U M3 and 
M2U M3. The number of alternating M;-M; paths is at most s, so there are three 
nodes labeled {1,2,3} that lie on the same M;-M; paths for all three choices of 
{i,j} C {1, 2,3}. Two of these nodes, say u and v, belong to the same color class 
of G. 

We need one more combinatorial preparation: the alternating M;-M,; path 
through u and v has a subpath Q,; between u and v, which has even length, and 
therefore starts with an edge labeled 7 and ends with an edge labeled j, or the other 
way around. It is easy to see that (permuting the indices 1,2,3 if necessary) we 
may assume that Qj2 starts at u with label 1, and Qo3 starts at wu with label 2. 

Now traverse Qo3 starting at u until it first hits Qi2 (this happens at v at the 
latest), and return to u on Qj. Since G is bipartite, this cycle is even, and hence 
alternates with respect to Mz. But it contains an edge labeled 2, which leads to a 
contradiction just like above. 


Now we are able to prove the finite basis theorem for cover-critical graphs. 


Proof of Theorem [18.3] Let G be a basic cover-critical graph with 6(G) = 6, 
let T be a minimum node-cover in G, let F' be the set of edges induced by T,, and let 
G’ be obtained by deleting the edges of F from G. By Corollary 18.11} IF] < (47). 

Let Ri,..., Rx be the minimal sets of nodes that cover all edges of F’. Trivially 
|R;| < |F| and k < 2l¥!. Let 7; = 7(G’\ R;); clearly 7; > 7(G)—|R;|. Since G’\ R; 
is bipartite, it has a matching M; of size 7;. Let S; =V\V(M;). (We can think of 
the matching M; as a “certificate” that R; cannot be extended to a node-cover of 
size less than 7(G).) 

By this construction, the graphs G’—S; have perfect matchings. We claim 
that deleting any edge from G’ this does not remain true. Indeed, let e € E(G’), 
then G\e has a node-cover T with |T| < 7(G). Since T covers all edges of F’, it 
contains one of the sets R;. Then T’— R; covers all edges of G’ — R; except e, and 
so T(G’—R;-e) < |T|—|Ri| < 7(G)—|R;| < 7;. Thus G’— R;—e cannot contain 
a matching of size 7;, and hence G’ — S;—e cannot contain a perfect matching. 

We can estimate the size of the sets 5; as follows: 


|S;| =n—27, < n—2(7r(G)—|R,|) = 2|R,|-—6 < 2|F|—6 < &. 
We can invoke Lemma |18.12) and conclude that G’ has at most 2 ee nodes of 


degree at least 3. Putting back the edges of F' can increase this number by at most 
2|F|, and hence 


n< 2(5)8°+26(5-1) 27. 
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18.3. Matchings 


18.3.1. Gallai’s Identity. Let G be a graph without isolated nodes, and 
let p(G) denote the minimum number of edges covering all nodes. A simple but 
important fact in graph theory is Gallai’s Identity: 


(18.13) p(G)+v(G) =n. 


As a warm-up to the topic of matchings in frameworks, we prove a generalization 
of this identity. Let (G,v) be a framework in R?¢, where we assume that the graph 
G has no isolated nodes and the vector labels span the whole space. Recall that 
v(G,v) denotes the maximum cardinality of a matching. Let us call a set of edges 
an edge-cover, if their endpoint labels span the space R?, and let p(G,v) denote 
the minimum cardinality of an edge-cover. We claim that 


(18.14) p(G,v)+v(G,v) =d. 


To prove this, first let M C E bea matching of size v(G,v). The vector labels 
of edges in M span a subspace L of dimension 21(G,v). If this is not the whole 
space, there must be an edge f with at least one endpoint label not in L, so adding 
this edge to M we increase the dimension of L. Repeating this at most d—2v(G,v) 
times, we get an edge-cover. Thus 


p(G,v) < v(G,v) + (d-2v(G,v)) = d—-v(G,v). 


Conversely, let S be an edge-cover of minimum size, and let M C S be a matching 
of maximum size among subsets of S. For every edge e € S'\ M, the set MU {e} is 
not a matching, and hence rk(V(M)UV(e)) < 2|M|+1. Thus 


d=rk(V(S)) =rk(V(M)UV(S\M)) < 2|M|+|S\M| 
=|S|+|M| < p(G,v)+v(G, v). 


This completes the proof of (18.14). 


Example 18.13 (Pinning a framework). Let (G,u) be a bar-and-joint frame- 
work in the plane. Our goal is to pin down some of the nodes so that we prevent 
all infinitesimal motions. Recall the basic equation system (15.4): 

(18.15) (u;—u;)"(vi-vj)=0 (ij € B), 

where the unknowns are the 2n coordinates of the velocities v. A pin at node 2 
adds the equations v;,; = 0 and v;,2 = 0. Note that neither one of these equations is 
automatically fulfilled, since a rigid translation in a non-coordinate direction does 
not satisfy them. 

Recall that Inf C R?*” denotes the linear space of infinitesimal motions. A 
condition vj, = 0 defines a hyperplane in Inf; let f,; be its normal vector in Inf. 
Pinning a set S C V eliminates all infinitesimal motions if and only if the vectors 
fui; (v = 1,2, i € S) generate the whole space Inf. 

Define a graph H on the set W = {1,2}xV by connecting (1,7) and (2,7) 
for every 7 € V, and label node (v,i) by f,;. Without the vector labels this is a 
trivial graph, but the vector labels make it interesting. By our discussion above, 
the minimum number of nodes of G whose pinning fixes the framework is equal 
to p(H, f). By Gallai’s Identity (18.14), this is equal to dim(Inf)—v(H, f). To 
determine the dimension of Inf is a straightforward computation in linear algebra. 
We'll see that v(H, f) is computable in polynomial time, so the pinning number is 
computable in polynomial time. 4 
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18.3.2. Bipartite frameworks. We want to extend the fundamental Mar- 
riage Theorem, which states that for bipartite graphs, equality holds in (18.1). Let 
us define a framework to be bipartite if its node set has a bipartition V = V; UV2 
such that every edge connects Vj to V2, and rk(V) = rk(Vi)+rk(V2). In other 
words, the space has a decomposition R¢ = L;@Lz such that every edge has an 
endpoint labeled by a vector in L, and an endpoint labeled by a vector in Lo. 


Theorem 18.14. If (G,v) is a bipartite framework, then v(G,v) = 7(G,v). 


A proof follows almost immediately from the results of the previous section 
(this proof would extend to the non-representable case easily). 


Proof. We have seen that v(G,v) < 7T(G,v), so it suffices to prove the reverse 
inequality. We may assume that (G,v) is cover-critical (just delete edges and/or 
nodes as long as T(G, v) remains unchanged; it is enough to find a matching of size 
T(G,v) in this reduced graph). Let V = V, UV2 be the bipartition of (G,v), then 
both V, and V2 are node-covers, and hence 

k(V; k(V; k(V 
T(G,v) < min{rk(Vi),rk(V2)} < sl ve (V2) ans : ) 
By (18.12) and the remark after it, this implies that (G,v) is a matching, and so 
v(G,v) =m > 7(G,v). 


18.3.3. General frameworks. Tutte’s Theorem characterizing the existence 
of a perfect matching and Berge’s Formula describing the maximum size of match- 
ings extend to frameworks as well [Lovasz 1980}. Various forms of this theorem are 
called the “Matroid Parity Theorem” or “Matroid Matching Theorem” or “Poly- 
matroid Matching Theorem”. In spite of these names, this result does not belong to 
matroid theory; it uses more from the structure of linear spaces than what matroids 
can capture. 

In a framework, we can consider the 2-dimensional subspaces determined by 
the pairs of vectors assigned to the endpoints of an edge. The matching problem 
can be formulated in terms of these subspaces, and in fact this provides useful 
geometric insight. 

Recall an elementary exercise from geometry: if a family of lines in projective d- 
space has the property that every pair intersects, then either all lines lie in the same 
plane, or they all go through one and the same point. In the language introduced 
in Appendix [A.4] a set of lines contains no two independent lines if and only if 
either they are coplanar or concurrent. 

What happens if we exclude three independent lines? Figure [18.5] shows four 
configurations with this property. Theorem [18.15] below will imply that these are 
all. 

We can rephrase this as a question about 2-dimensional linear subspaces of real 
linear space instead of lines in a projective space. Our starting observation is then 
that if a family of 2-dimensional linear subspaces in a real linear space contains 
no two linearly independent subspaces, then either they are all contained in a 3- 
dimensional subspace, or they all contain a common 1-dimensional subspace. The 
general question is: what is the maximum number of linearly independent subspaces 
in a family of 2-dimensional subspaces? 

To show how this question relates to the matching number of frameworks, select 
a finite set V of vectors so that each of the given subspaces contains at least two 
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FIGURE 18.5. Four configurations of lines in a projective space 
(of arbitrary dimension) not containing three independent lines: 
(a) lines in a 4-dimensional subspace; (b) lines in a 3-dimensional 
subspace, together with lines going through a particular point of 
this subspace; (c) lines in two 2-dimensional subspaces; (d) lines 
intersecting a given line. 


of them, and for each of the given subspaces, join two of the points of V in it by 
an edge. The condition on the subspaces says that the matching number of the 
resulting framework is less than k. 

Conversely, consider a framework. If the vectors representing the endpoints 
of an edge are not linearly independent, then this edge will not be a member of 
any matching, and we can forget about it. So we may assume that the vectors 
representing the endpoints of any edge generate 2-dimensional subspace. A set of 
edges forms a matching if and only if the corresponding 2-dimensional subspaces 
are linearly independent. 

There is a trivial bound on the matching number of a framework in R?: 


(18.16) v(G,v) < ISI. 


Combining the ideas behind the bounds (18.2) and (18.16), we can formulate an 
upper bound that, for an optimal choice, gives equality |[Lovasz 1980}. 
Theorem 18.15. Let (G,u) be a framework in R4, and let P = {Ey,..., Ex} range 


over all partitions of its edge set and let L range over linear subspaces of R¢. Let 
L; = lin(u(V(£;))). Then 


k 


(Gv) = min{ dim(L) +> 5 dim(Li/L)| }. 


i=1 
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The bound corresponds to the choice k = 1, L = 0, L; = R®. To get the 
bound (18.2), let L be the linear span of a node-cover T,, and let V\ T = {i1,..., iz}, 
and let £; consist of all edges contained in the linear span of L and uj,. 

The proof of the < direction in T heorem[18-15]is easy: for any subspace LZ and 
any partition P, we have 


v(G,v) < dim(L)+v(G,v/L) 
and for any matching M in (G,v/L), we have 


7 


and hence 
k k 
dim(L;/L) 
M| = S°|MnE\| < [a |. 
|M| 2! n Is), 5 


The proof of the > direction is quite lengthy, and has been reproduced in another 


book |Lovasz—Plummer 1986], so we do not present it here. 


Example 18.16 (Graph matching). Let us apply the above result to “ordinary” 
matchings. If the nodes of the graph G are labeled by the standard basis (e; : i € V) 
in RY, then v(G,e) = v(G). So the graph G has a perfect matching if and only if 
v(G,e) = n/2. By Theorem|18.15} this holds if and only if 


k 
(18.17) dim(L) +>) 5 dim v(E;)| > 5 


for every partition P = {F,,...,E,} of the edge set and every linear subspace L, 
where v; = e;/L. If L is spanned by some of the vectors e;, then we can directly 
see that this condition is equivalent to Tutte’s. In the general case, we need a little 
induction argument. 

If any v; = 0, then we can delete this node and proceed by induction. If the 
graph is disconnected, we can use induction on the connected components. 

So suppose that G is connected and the vectors v; are nonzero. If k > 1, then, 
since the graph is connected, there are two partition classes, say Fy, and E2, with 
V(E,)NV(E2) 4 0. Then 


dim v(V(E, UE2)) < dimv(V(£;)) +dim v(V(£2)) —1, 
which implies that 
[5 dim v(V(B,UB2))| < [5 dimv(V(E,))| +] 5 dimv(V(E,))], 


so we can merge FE, and EF to get a partition for which the inequality (18.17) is 
tighter. Going on similarly, we get that it suffices to check (18.17) for the partition 
of the edge set into a single class. Then we need 


dim(L) + 5 dimv(E)| > 5 


Since dim v(/) = n—dim(Z), this holds unless dim(Z) = 0 and n is odd, so Tutte’s 
condition is violated. 4 
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18.3.4. Matching and multilinear algebra. Let (G,u) be a framework, 
where u: V — R¢. To every edge e = ij, we assign the antisymmetric tensor 
A. = u;/uy,. (If you prefer a more elementary language, we can instead consider 
the skew symmetric dx d matrix A, = ujuj —u;u;.) The sign of A, depends on 
the orientation of the edge e, and we fix an orientation for each edge (this will not 
play an important role). 

By basic exterior algebra, it is clear that the edges €1,...,€m form a matching 
if and only if Ae, A---AAe,, # 0. It would be nice to have a proof of Theorem 
[18.15]based on exterior algebra, but this seems to be elusive. However, the following 
theorem (see [Lovasz—Plummer 1986]) generalizes a characterization of graphs with 
perfect matchings [Tutte 1947], and it does provide a useful tool for randomized 
matching algorithms. 


Theorem 18.17. For every framework (G,u), 


v(G,u) = Zee rk( teAe). 


2 reER 
ecE 

Proof. First, we prove the < direction. Let k = v(G,u), and let M bea 
maximum matching. The assertion is independent of the basis in R¢, so we may 
assume that the endpoints of the edges in M are labeled by e1,...,e€2,. Choosing 
x = lym), the matrix >. teAe will consist of 2k 1’s in different rows and columns 
and zeros everywhere else, so its rank will be 2k. 

Second, let 2r be the rank of the matrix A = >, 2.A- for an arbitrary choice 


of the numbers x; (the rank of a skew symmetric matrix is always even). Then, by 
elementary multilinear algebra, 


AN-+-AA£0 
Ve ~++_—S 


Expanding this exterior product, there must be a nonzero term, i.e., there must be 
edges €1,...,e, such that Ae, A--:AAc, #0. Let ex = ixjx, then 


Ae, A:++AAe,, = Ui, AU, A+? A Uj, AU;,, 


and if this is nonzero, then uj,,U;,,...,U;,,U;, are linearly independent, and so 
{e1,...,e,} is a matching. This proves that v(G,u) > r. 


Theorem [18.17] is of the form max = max, and so it does not provide a good 
characterization of the matching number. But the maximum on the right side is 
clearly attained for algebraically independent values of the z;, and so the maximum 
is attained for almost all random choices, say from the uniform distribution in 
[0,1]. By the Schwartz—Zippel Lemma [3.12| the maximum is attained with high 
probability if the coefficients x; are chosen randomly from (say) {0,1,...,n?}. So 
the theorem provides an efficient randomized algorithm for computing v(G, u). 


Example 18.18 (Gyrators). A gyrator g is an ideal device in an electrical net- 
work: it has a real parameter Ry (its resistance), and it forces the currents and 
voltage differences of two given directed edges ij and kl as 


(18.18) Uij = Relki Uni = —Rgli;. 


Note that R, depends on the order as well as on the orientation of the two edges: 
if we reverse 17, then Ry changes sign, and so it does when the edges 2j and kl are 
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interchanged. We denote this gyrator by g(ij, kl) if the participating edges need 
emphasis. 

Perhaps it helps understand this if we compare it with an (ideal) transformer, 
which also establishes a relationship between two edges ij and kl: 


Ui; = aug Tri = al;;. 


The behavior of a gyrator is more complicated, mostly because of the negative sign 
in the second equation. 

Consider an electrical network composed of (ohmic) resistances, voltage and 
current sources, and gyrators. Determining the currents and voltages on the edges 
means the solution of a system of linear equations in the variables U;; and [;;. This 
system may be overdetermined or underdetermined, depending on the graph and 
on the numerical values of the resistances. 

But let us assume that the parameters (resistances) of these devices are alge- 
braically independent: the network is “generic”. (This is not such an unreasonable 
assumption: in the case of real physical devices, their resistances are only approxi- 
mately specified by the production process, and so no two will be exactly the same, 
and more generally, no exact algebraic relation will hold between them.) In this 
case, the combinatorial information (the graph and the pairs of edges that are re- 
lated by gyrators) determines whether the network is over- or underdetermined. 
The following theorem was proved by [Milié 1974): 


Theorem 18.19. Let G be a network consisting of voltage and current sources, 
resistances, and gyrators. Suppose that the resistances and the parameters of the 
gyrators are algebraically independent over the rationals. Then there is a unique 
assignment of currents and voltage differences to the edges if and only if G has a 
spanning tree which contains all voltage sources, no current source, and either both 
or none of the edges of each gyrator. 


Proof. We work with the electric potential function p: V — R. For the sake 
of simplicity, suppose that the network contains only gyrators. Then setting all 
currents and voltages to zero, we get a solution; we want to know whether this 
is unique. The voltage of an edge 77 can be expressed as U;; = p;—p;, and the 
currents are determined by 


1 1 
18.19 I; =—-—(pi—pe), le = —(p;—p; 
(18.19) aa? (r—Pr), Lea R, (pj — Di) 
for the two edges 77 and kl forming a gyrator g. The only condition that is left is 
Kirchhoff’s Current Law: 


(18.20) Sep 0: 

JEN (i) 
So we have n equations in n unknowns p;. These equations are not independent, 
since their sum is trivially zero; but this is fine, since we need only the potential 


differences p;—p;. For the equations to determine the differences p; —p; uniquely, 
the rank of the system of these equations must be n—1. 


Substituting the expressions (18.19) in (18.20), we get after some transforma- 


tions 


g=9(ij,kl) 
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where the summation extends over all gyrators. By Theorem the rank of 
the matrix of this system is twice the matching number of the framework (H,v) 
on node set £, in which two edges are adjacent if and only if they form a gyrator, 
and an edge e = ij (as a node of #) is labeled with the vector v;; = e;—e;. It is 
easy to check that (n—1)/2 edges of H form a matching in (H, v) if and only if the 
n—1 edges of G corresponding to them form a spanning tree in G. 


So the gyrator problem can be reduced to the matching problem of frameworks. 
Theorem |18.15} gives a good characterization (NPNco-NP). 4 


Exercise 18.1. Let G be a cover-critical graph with a node wu of degree two, 
different from the triangle. Prove that the neighbors of u ar nonadjacent, and 
have no neighbor in common other than wu. 

Exercise 18.2. Prove that the nodes of a cover-critical graph can be covered by 
node-disjoint edges and odd cycles of the graph. 

Exercise 18.3. Let (G,v) be a cover-critical framework. Prove that for every 
node there is an optimal node-cover that contains it and also one that avoids it. 
Exercise 18.4. A fractional cover of a graph Gis a map w: V > R+ such that 
w(t)+w(j) > 1 for every edge ij. Let 7*(G) = min >, w(z), where the minimum 
ranges over all fractional covers of G. Prove that 7*(G) = 72(G)/2. 

Exercise 18.5. Let V be a finite set and let f : 2” — R be a setfunction. 
Extend f to a function f : RY — R by f(w) = ¥7)_, aif (5), where the a; and 


i=1 
S; are defined as in (18.3). Prove that f is convex if and only of f is submodular. 
Exercise 18.6. Prove that for any framework, tT2:(G,v) = k72(G,v) and 


Tor+i(G, v) => kr2(G, v) + T1 (G, v). 

Exercise 18.7. For every representation v of G, the setfunction rk(2x) is sub- 
modular. 

Exercise 18.8. Let i be a node of a framework (G,v). Let us delete all edges 
incident with 7, create a new node j, connect it to 7, and represent it by a vector v; 
that is in general position in the space lin(v(N(i)). Let G’ be the graph obtained. 
Prove that 7(G,v) = 7(G’,v), and if (G,v) is cover-critical, then so is (G’,v). 
Exercise 18.9. For a framework (G,v), split every node i into deg(i) nodes 
of degree 1, and represent each of these by the vector vi. Let (G’,v’) be the 
framework obtained. Prove that 7(G, v) = 7(G’, v), and (G, v) is cover-critical if 
and only if (G’,v’) is cover-critical. 

Exercise 18.10. Prove that if a family of lines in projective d-space contains no 
three independent lines, then it is one of the configurations in Figure [18.5] 
Exercise 18.11. Let B be a skew symmetric matrix of dimension p = 2n. For 
each partition P = {{t1,j1},...,{in, jn}} of the set {1,...,2n} into pairs, let 
mp denote the permutation 2k—1 +> ig, 2k > jx of {1,...,2n}, and let bp = 
sgn(mp)bi,;, +++ bi, 7, (where sgn denotes the sign of the permutation). Define the 
Pfaffian of B by Pfaff B = 7, bp. 

(a) Prove that bp depends neither on the order in which the classes of the partition 
are listed nor on the order of the two elements of a class. (So Pfaff B is well 
defined.) 

(b) Prove that det B = (Pfaff B)?. 


(c) Let B be obtained from the adjacency matrix of G by replacing each pair of 
1’s corresponding to the same edge by x. and —2z-, where the numbers z- are 
algebraically independent transcendentals. Prove that G has a perfect matching 
if and only if det(B) # 0. 


CHAPTER 19 


Combinatorics of Subspaces 


In this chapter we have a look at several results from previous chapters, from 
a more general point of view. In fact, we do so in two stages. 

First, we discuss some results about combinatorial properties of subspaces, 
which generalize results concerning stresses and rigidity, covering and matching in 
frameworks, and more. Collections of subspaces (points, lines, planes) in Euclidean, 
affine, or projective spaces have many interesting and important combinatorial 
properties, which are treated in detail in books about combinatorial geometry, 
and could themselves fill one or more monographs. There are deep studies of 
the numbers of cells of different dimensions they form, and of the topological and 
algebraic geometric properties of their union or its complement, just to name a 
few. Some of their properties are generalizing graph-theoretic results treated in 
this book, and we restrict our attention to results of this nature. 

The second level of more abstract and more general results uses matroid theory. 
This theory has been the topic of several monographs, and it is difficult to fit it 
in a book whose aim is to establish more direct connections between graph theory 
and geometry. On the other hand, matroid theory provides deeper insight into 
several of the results presented in this book. In the last section of this chapter, a 
minimalist introduction to matroids is followed by a survey of how some of the key 
constructions and key results can be put in the framework of matroid theory. 


19.1. Covering of hypergraphs and subspaces 


Several questions about covering the edges of graphs extend to covering the 
edges of hypergraphs, and even to covering the members of a family of subspaces. 


Bollobds 1965] proved the following generalization of Theorem b): 


Theorem 19.1. Let A;,..., Ay be p-element sets and B,,..., By, g-element sets. 
Suppose that A,B; =0 for alli, but ANB; £0 fori Aj. Thenn< ue 


To obtain Theorem |18.2| we take p = 2 and gq = 7(G)—1, let the sets A; be 
the edges of G, and let B; be a minimal node-cover of G\{A;}. Recall that we 
proved and used a generalization (Theorem [18.8{b)), where B; was a subspace of 
dimension 7(G,u)—1 covering all edges of the framework (G,u)\{A;}. An anal- 


ogous generalization of Theorem can also be proved (see Exercise {19.5). One 
can even replace both families of subsets by families of subspaces |Lovasz 1979al: 
Theorem 19.2. Let Aj,...,An be p-dimensional subspaces and B,,...,Bn, q- 


dimensional subspaces of R¢. Suppose that A;NB; = 0 for all i, but A;NB; #0 
fori#j. Thenn< er 
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Proof. We may assume that d = p+q. Indeed, trivially d > p+q. If d > p+q, 
then the orthogonal projections A’ and Bi of A; and B; onto a generic (p+q)- 
dimensional space satisfy the same conditions as the original subspaces A; and 
B;, so it suffices to prove the lemma for these projected subspaces of a (p+q)- 
dimensional space. 

For a subspace A C R®%, choose a basis a;,...,a, and define the antisymmetric 
r-form V4 = a,/---Aa,. This form depends on the choice of the basis, but only up 
to a nonzero scalar factor. The important property for us will be that V4A\ Vp =0 
if and only if the subspaces A and B have a nonzero intersection. In particular, 
W4,\Vp, #0, but V4, AUB, =0 foriF¥ j. 

We claim that the tensors Vy,,..., U4, are linearly independent. Suppose (by 
way of contradiction) that there is a linear dependence 5°, a; 4, = 0, and let 7 be 
any subscript with a; 4 0. Then 


0= (Soaita,) Ave, = So ai(a,AVp,) = 05(Va, Vp,) £0, 


a contradiction. 
It follows that n is not larger than the dimension of the space of antisymmetric 
p-tensors in a (p+q)-dimensional space, which is ae 


Remark 19.3. The proof shows that the condition that A;1B; 4 0 for i 4 j 
could be weakened: it would suffice to assume that A;.B,; 4 0 for i < j. Theorem 
[19.1]}can be proved by a similar linear algebra argument, replacing exterior product 
of vectors by their symmetric product, or (essentially equivalently) by the product 
of variables. 


These theorems can be applied to bound the size of critical families of sets. 
A hypergraph is a finite family of finite sets. The node-cover number T(H) of a 
hypergraph H is the minimum cardinality of a set meeting every member of H. 
The hypergraph is called cover-critical, if T(H’) < T(H) for every H’ CH. 

For a family F of subspaces, we define its linear covering number Tin(F) as 
the minimum dimension of a linear subspace intersecting every subspace in F in 
a nonzero subspace. The family F is called cover-critical, if Tin(F’) < nin(F) for 
every F’ C F. The first assertion of the next theorem was proved in [Bollobas 1965); 
the second, in [Lovasz 1977]. 


Theorem 19.4. 
(a) If H is a cover-critical family of k-sets, then |H| < Ce), 


(b) If F is a cover-critical family of k-dimensional subspaces, then |F| < 
(ee) : 


Proof. We prove the second assertion; the proof of the first one is similar. Let 
F = {Aj,...,An}. For each 1 <i < n, the family F\{A;} can be covered by a 
subspace B; of dimension !—1. This means that A;NB; # 0 for 7 A i. On the 
other hand, B; cannot cover F, and hence A;7 B; = 0. By Lemma|19.2} it follows 
that na). 


19.2. Transcendence degree and transcendence rank 


Notions like “general position” and “generic” have played an important role in 
several chapters of this book. In this section we put these notions in a more general 
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context, leading to a couple of combinatorial results in the next section that, as we 
shall see, generalize several previously presented results. 

We fix a real closed field K Cc R (that is, the square root of every positive 
number in K is in K, and every polynomial of odd degree with coefficients in K 
has a root in K). Most of the time, you could think of this as the field of real 
algebraic numbers without losing any of the essential ideas (many of the arguments 
would even work over the rational field), but for a few arguments we need this 
generality. It is well known that every real number that is algebraic over K belongs 
to K. We need, in addition, that K is not too large: we assume that R has infinite 
transcendence degree over K (so that we can find arbitrarily many real numbers 
that are transcendental over K). For a set A of real numbers, we denote by K(A) 
the real closed field generated by K and the elements of A. For a finite set A of 
vectors and/or matrices, we define K(A) = K(A), where A is the set of all entries 
of all elements in A. 

Let L be a linear subspace of R¢. It is clear that if L 4 0, then K(L) = R is 
uninteresting. To measure how “transcendent” L is, we consider “simple” vectors 
generating L. Let k = dim(L). There is at least one k-tuple of coordinates such 
that the projection of L onto these coordinates is k-dimensional; let us call such a 
k-tuple of coordinates basic. Unless said otherwise, we fix a basic k-tuple, and (for 
notational convenience) reorder the coordinates so that this basic k-tuple consists 
of the first k coordinates. Let R¢* denote the linear space of all vectors in R¢@ 
supported on the first k coordinates. The spaces R@* and R* are isomorphic in a 
trivial way: the map HT, : R? — R* deleting the last d—k coordinates provides 
this isomorphism; but it will be useful to distinguish them. 

There is a unique basis in LZ whose projection onto the first & coordinates is the 
standard basis in R*; we call this the standard basis in L (with respect to the basic 
set {1,...,k}). The matrix whose columns are the vectors in the standard basis 
can be written as (ay, where X € R(4-*)**, Any other basis of L can be obtained 
as the columns of a d x k matrix ( ee) with some nonsingular k x k matrix B. We'll 
abuse terminology and say that a dx k matrix A is a basis of L if its columns form 
a basis of L. 

We say that the subspace is algebraic over K, if it has a basis consisting of 
vectors with all entries in K. We are going to omit “over K” if no confusion can 


arise. 


Lemma 19.5. For every linear subspace L C R¢, there is a unique smallest real 
closed field over which it is algebraic. This field is generated (as a real closed field) 
by the entries of any standard basis. 


Proof. Let @) be a standard basis of ZL. Consider any other basis A = Cy 
Here B is a submatrix of A, and so the entries of B~' belong to the field K(A). 
Hence 


K(A) > K(B7!A) = K(%) = K(X). 


This shows that K(X) is the smallest field generated by any basis of L. 


We denote the field in the lemma by K,. The proof above shows that L is 
algebraic over a field K if and only if the standard basis (with respect to one, or 
all, basic k-tuples of coordinates) has all of its entries in K. Another consequence 
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of Lemma |19.5}is that 
(19.1) K,1 = Kz, 


- 
Ta-k 


since the space L+ has a standard basis ( 
Kr. 

For a field K and a set of real numbers X, we define the transcendence degree 
deg,,(X|IK) of X over K as the maximum number of elements of X that are al- 
gebraically independent over K. In the case of the field denoted by K, we often 
use the simplified notation deg,,(X), where “over K” is understood. For a linear 
subspace L C R®, we define its transcendence rank as the transcendence degree of 
Ky over K: 


(19.2) O(L) = 0(L|K) = deg,,(K,) = min{deg,,(A|K) : A is a basis of L}. 


We need some simple properties of the transcendence rank. From the proof of 
Lemma we see that for any k-dimensional subspace L C R?, we have 


), and so Kz. = K(—X") = K(X) = 


(19.3) O(L) < k(d—k). 
Equation implies that 

(19.4) O(L+) = O(L). 

It is easy to see that 

(19.5) O(L,+L2) < 0(£1)+0(L2). 


We have noted that “bad” bases of a subspace LZ can have arbitrary transcen- 
dental entries. But no basis can be “too transcendental” . 


Lemma 19.6. For every k-dimensional subspace L C R¢, we have 
max{deg,,(A) : A is a basis of L} = 0(L) +k’. 


Proof. Let @) be a standard basis of L, then deg,,(X) = @(L). Let (ae) 
be any other basis, where B is a nonsingular k xk real matrix. Since the entries 
of ( ) can be computed from the entries of (2) algebraically, and vice versa, we 
have 


dete (i) = deter) = dete X) + dee (BIK(X)) = 6(E) + dee, (BIK(X)). 


The last term is maximized when all entries of B are algebraically independent over 
K(X). 


19.2.1. Transcendental subspaces. A subspace is “generic”, if it does not 
satisfy any special condition it does not absolutely have to... This is not a defini- 
tion. One way of making it precise is the following. 

We say that a set of vectors in R@ is fully transcendental (over K) if all their 
entries are algebraically independent over K. It is easy to see that a fully transcen- 
dental set of & vectors is linearly independent as long as k < d. 

A k-dimensional subspace L C R?@ is transcendental (over K), if it has a fully 
transcendental basis. By Lemma|19.6] this is equivalent to saying that 


(19.6) 6(L) = k(d—k). 


By (19.4), this implies that a subspace L C R¢ is transcendental if and only if L+ 
is. 
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Applying a nonsingular linear transformation with entries in K, every algebraic 
subspace is mapped onto an algebraic subspace, and every transcendental subspace 
is mapped onto a transcendental subspace. If K C K’ C R are fields, then any 
subspace algebraic over K is algebraic over K’, and any subspace transcendental 
over K’ is transcendental over K. If J is a transcendental subspace of R?, then 
any k = dim(.J/) coordinates are basic, since the projection of a fully transcendental 
basis of J to R* is a fully transcendental basis of R*. 

Let L C R¢@ by any subspace of dimension k. We say that a subspace of L 
is transcendental within L, if it is the intersection of L with a subspace that is 
transcendental over K;. Note that 0 and LZ are both transcendental subspaces of 
D. 

In the special case when L = R%, to be transcendental within L is equivalent to 
being a transcendental subspace of R¢. More generally, we can state the following 
fact, which is absolutely natural, but it takes some work to prove it (Exercise[19.2). 


Lemma 19.7. A subspace J C R®* is transcendental within R@* if and only if its 
projection U,.J is a transcendental subspace of R*. 


If ZL is any algebraic subspace of dimension k, then we can apply a linear 
transformation with entries in K that maps L onto R®*, and this preserves tran- 
scendentality of subspaces of L. Sometimes we will need this linear transformation 
to be orthogonal. For this, we need to compute an orthonormal basis B in L, to be 
mapped onto the standard basis of R“"; in general, this basis cannot be expressed 
rationally by the entries of a standard basis of L, but the entries of B are in K 
nevertheless (here we use that K is real closed). We will refer to this procedure as 
“normalization” . 

Using normalization, we can extend several simple previous observations about 
transcendental subspaces to subspaces that are transcendental within a subspace. 
For example, we have 


(19.7) Ox, (H) < dim(H)(dim(./) —dim(H)) 


for any two subspaces H C J, with equality if and only if H is transcendental 
within J. As another consequence, we have seen that the orthogonal complement 
of a transcendental subspace of R®@ is transcendental. This can be extended to 
transcendental subspaces of any subspace: If H is transcendental within J, then so 
is JOH+. 


Lemma 19.8. Let H C J be subspaces of R? with dim(H) = q and dim(.J) 
Then 


Te 


O(J)—(r—q)(d—-r) < O(H) < OJ) +(r—-a)a 
If equality holds in the first inequality, then JOH+ is transcendental within H+. 
If equality holds in the second, then H is transcendental within J. 


Proof. Since H and JN H+ generate J, we have 
0(J) < 0(H)+0x,,(JNH*). 
Here by and (19.7), 
(19.8) Ox,,(JNH+) < dim(JNH*)(dim(H*) —dim(JNH+)) = (k-r)(d—k). 


This proves the first inequality. If equality holds, then we have equality in (19.8), 
which means that JO H+ is transcendental within H+. 
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The second inequality follows from the first by replacing H and J by J+ and 
H+, using the relations 0(J+) = 6(J) and 6(H+) = 0(H). In the case of equality we 
know that H+ J is transcendental within J. Then so is its orthogonal complement 
within J, which is just H. 


19.2.2. The intersection of algebraic and transcendental subspaces. 
The following lemma summarizes a number of useful relations between an algebraic 
subspace and a transcendental subspace. 


Lemma 19.9. Let L be a algebraic subspace and J, a transcendental subspace of 
R?. Let k =dim(L) andr = dim(J). Then 

a) either L+ J =R¢ or LOJ =0; 

b) ff L+J #R¢, then 0(L+J) =r(d—k—-r); 

c) if LAT £0, then 6(LNJ) = (k+r—d)(d—r); 

d) J is transcendental within L+ J, and LO J is transcendental within L. 


Proof. We may assume by normalization that L = R@*. 

a) First, assume that k+r =d. Let B be a fully transcendental basis of J. 
The submatrix of B formed by the last r rows is nonsingular, and hence no vector 
in J is supported on the first k = d—r rows. Thus LN J = R?*qJ =0. This also 
implies that L+J = R?. 

Next, if k+r < d, then we extend L arbitrarily to a (d—r)-dimensional alge- 
braic subspace L’, and by the above, L'NJ = 0 and so LN J = 0. It follows by a 
similar argument that if k+r > d, then L+J =R?. 

(b) We have dim(ZL+ J) = k+r < d by (a). We may assume that the last r 
coordinates are basic in J, so J has a basis of the form leas where B € R(@-™)*?, 
Let B’ be the (d—r—k) xr matrix obtained from B by deleting its first k rows. 
Then the columns of the matrix 


IT, 0 
A={0 B 
0 I, 


On the other hand, J is transcendental in R%, so 6(J) = r(d—r), Lemma 
implies that 0(L+J) > 0(J)—kr =r(d—k—r). 

(c) This follows by applying (b) to the algebraic subspace L+ and transcenden- 
tal subspace J+. 


(d) Both assertions follow from Lemma since the pairs J C L+J and 
LAJ CL satisfy the inequality in the lemma with equality. 


form a basis of L+ J, showing that 0(L+ J) < deg,,(A) = deg,,(B’) < (d—k-r)r. 
fr 


Here is a heuristic idea; the rest of this section will be devoted to make it 
precise. Let us choose a “generic” subspace J and a subspace H C J. We expect 
that JO H+ is a “generic” subspace in H+. Otherwise, making a “generic” choice 
U among the subspaces of H+ of the same dimension as JN. H+, the subspace 
H+U would be even “more generic” than J. 

This conclusion is, however, false in general: 


Example 19.10. Let a,b and c be algebraically independent over K, and consider 
the 2-dimensional subspace J C R® defined by the equation ax+by+cz = 0. Then 
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J+ is generated by the fully transcendental vector (a,b,c), and hence J is a tran- 
scendental subspace of R?. Let H be the 1-dimensional subspace generated by the 
vector (1/a,1/b,-2/c)'. Clearly H C J, and Ky = K(a/c,b/c) = Kj. A vector 
spanning the subspace JM. H+ can be computed in Ky = Ky, and so it is not a 
transcendental subspace of H+. ¢ 


Lemma gives a sufficient condition on J and A for the above conclusion 
to hold. A simple special case when this applies is if H = JN L for some algebraic 
subspace L. More generally, 


Lemma 19.11. Let J be a transcendental subspace of R%, and let H = (Li 
J)®@++-@(LmOJ), where Ly,..., Lm are algebraic. Then JNH+ is transcendental 
within H+. 


Note the “&” operation sign, which implies that the subspaces [1 J,..., LmM 
J are linearly independent. 


Proof. We may assume that the intersections DL; J are nonzero subspaces. 
Let r = dim(J) and k; = dim(Z;), then dim(L;N J) = ki +r—d by Lemma a), 
and so dim(H) = S7y",(ki +r—d). By Lemma d), the subspace [iN J is 
transcendental within L;, and hence 0(£;0 J) = (kj +r—d)(d—r). Thus 

O(H) < S°O(LiNJ) = S_(ki+r—d)(d—r) = dim(H)(d—r) 
i=1 i=1 


= 6(J) —(r—dim(H))(d—r). 


By Lemma|19.8| this implies that JQ H+ is transcendental within H+. 


19.3. Generic points 


19.3.1. The rank of generic points. After these rather technical lemmas, 
we come to the first main result in this topic. We say that a point a is a generic 
point of a rational subspace L, if lin(a) is a transcendental subspace of L over K. 
The definition guarantees that a generic point of a rational subspace L does not 
have any property expressible by algebraic relations (with coefficients in K) between 
the entries of the point, unless all points of L have this property. For example, if a 
is a generic point of a rational subspace L, and a € J for another rational subspace 
J,then LC J. 

Informally, if we introduce in some arguments below a “generic point” a, we 
mean that it is fully transcendental over the field generated by all coordinates of 
all vectors introduced in the argument previously (including appropriate bases of 
all the subspaces we considered). 


Lemma 19.12 (Generic Point Lemma). Let F be a family of subspaces of R®. 
For every A € F, select a generic point x4 € A. Then 


tk{x4: AE FL= iin (\F\G|+rk(G)). 


By the remark above, we should select the points x4 one by one so that they are 
generic taking into account the previously selected points. This property does not 
depend on the order in which these points are selected. 
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Proof. For every G C F, we have rk{x4: A €G} < |G] and rk{x4: Ae 
G} <rk(G), whence 


(19.9) trk{x4: ACF} <rk{x,: AC F\Gh4+rk{x,: AEG} 
< |F\G|+rk(g). 


(This inequality holds, in fact, for every choice of representatives x4 € A, not only 
for a generic choice.) 

To prove that equality holds for a generic choice of the representatives and for 
a suitable G, let us consider any subspace A € Ff, and its representative x4. If 
x4 ¢lin{xg: Be F\{A}}, then we can delete A from F and apply induction: 


rk{xp: Be F}=14rk{xg: Be F\{A}} =14|F\{A}\G| +rkG 


for a suitable G C F\{A}. This shows that this set G gives equality in (19.9) as 
well. 

So we may assume that x4 € lin{xg: B¢ F\{A}}. Using the assumption 
that x,4 is a generic point of A, it follows that A C lin{xg: Be F\{A}}, and 
so, even more, A C lin{xg : B«€ F}. This holds for every subspace A € F, and 
hence UF C linfxg: B«€ F}. Clearly equality must hold, whence 


rk{x4: Ae F}=rk(F), 
showing that G = F gives equality in (19.9). 


The following version of the Generic Point Lemma does not involve generic 
points. 


Corollary 19.13. Let Aj,...,An be finite subsets of a linear space R¢. Then 


ps rk{aj,..., an} = a (»(U Ai) +n— 5) : 


We could replace the expression on the left by 


max{|X|: X C R®, X linearly independent, |X .A,| < 1}. 


Proof. The max < min direction is clear by the same simple argument as 
above. To prove the reverse inequality, let D; = lin(A;), and let x; be a generic 
point in L;. Then 


rk{x1,...,Xn} = a (H(U A.) +n— s) 


iEeS 


by the Generic Point Lemma|19.12} We want to replace the points x; by appropriate 
points a; € A; so that the dimension of the set does not decrease. This can be 


done one-by-one: if x, is linearly dependent on x2,...,Xn, then we can replace 
it by any element of A;. If it is linearly independent from x2,...,xXn, then L, Z 
lin{x2,...,Xn}, and hence A; has an element a; ¢ lin{xo,...,x,}, and we can 


replace x; by a;. Repeating this procedure for every x;, the corollary follows. 


We can also formulate a probabilistic version of the theorem. 
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Corollary 19.14. Let F be a family of nonnull subspaces of R¢. For every A € F, 
select a random point x4 € A uniformly from the unit sphere in A. Then almost 
surely, 

tk{x4: AC F}= min (\F\G|+rkG). 


We could replace the uniform distribution on the sphere by any probability 
distribution on A that is not concentrated on a finite number of hyperplanes in A. 


19.3.2. Applications of generic points. As an application of the Generic 
Point Lemma, let us prove the Matroid Sum Theorem, at least in the linear case. 


Theorem 19.15. Let us consider k vector labelings vi : V + RY (j =1,...,k) 
of the same set V = {1,...,n}. Let Y¥; CV (i=1,...,k) range over sets such that 
v'(Y;) is a linearly independent set of vectors. Then 


k 
Yi,U---UY;| = mi xX rk(v'(X)). 
max |Y¥4U---UYe| = gain |V\ oe (v'(X)) 
1= 
Clearly we could restrict the sets Y; to be disjoint, or, going in the other direc- 
tion, we could require that Y; is a basis for v’. 


Proof. The max < min direction is easy to check. To prove the converse, 
consider the set of vectors 


Vj 0 0 
0 v5 0 

(19.10) Aj = efx} =e] (FS Tiny) 
0 0 vi 


in R“ @---@R*. 

Let Y,,...,Y~% C V be disjoint sets such that v'(Y;) is a linearly independent 
set. For each j € Y;, choose aj € A; so that the i-th block of a; is nonzero. For the 
indices in V \ (Yi U---UY;), choose a; € A, arbitrarily. Then the set a(Y| U---UY,) 
is linearly independent, and so 


rk(a(V)) > |¥,U---UY,]. 
It is easy to reverse this argument, and show that for every choice of vectors a; € A; 
we can find disjoint sets Y; C V such that v‘(Y;) is linearly independent and 
rk(a(V)) = |¥,U-- -UY,]. 
So 
(19.11) maxrk(a(V)) = max |Y,U---UY;|. 
Y1,--5¥r 


a 


eer 


To find the maximum on the left, we use Corollary|19.13| and get the theorem. 


As a special case, we describe the answer to the following question: Given a 
finite set of vectors S C R*, when can we partition it into k linearly independent 
subsets? A trivial necessary condition is that |S] < dk. This condition, when 
stipulated for every subset of S, is also sufficient. 

Let S = {uj,...,Un}, and invoke Theorem [19.15] with a 
u: [n] > R¢. The set of vectors S can be partitioned into k linearly independent 
subsets if and only if the maximum on the left side of the min-max formula in 


374 19. COMBINATORICS OF SUBSPACES 


Theorem }19.15} is n; this means that the right side must be at least n for every 
X CS: krk(X)+|S\X|>n. Rearranging, we get the following corollary: 


Corollary 19.16. A finite set S C R@ can be partitioned into k linearly independent 
subsets if and only if 

krk(X) > |X| 
for every X CS. 


From this Corollary, it is not hard to derive the theorems of Tutte and Nash- 
Williams on covering a graph by forests and on disjoint spanning trees (Theorem 


see Exercise [19.9). 


19.3.3. Gale duality. Before giving an application of Theorem we 
describe a simple but very useful construction. Let V be a finite set of n elements, 
and let u: V > R¢@ be a vector labeling of V, where we assume that u(V) spans 
R?. Consider u as a dx V matrix A, then rk(A) = d, and the row space L of 
A d-dimensional subspace L of RY. Let b,...,bn—a be a basis of L+, which we 
consider as the rows of an (n—d) x V matrix B. Let {v;: j © V} be the columns 
of the matrix B. The vector labeling v of V is called a Gale dual of u. 

Being Gale duals is a symmetric relation between vector labelings of the same 
set. The Gale dual is not uniquely determined: choosing a different basis in D+ 
results in a different Gale dual. But all Gale duals are images of each other under 
linear transformations of R’~¢. 

The most important property of the Gale dual is the following. 


Proposition 19.17. Let v : V — R"@ be a Gale dual of the vector labeling 
u: V>R?. Let X CV, |X| =d be a basis foru. Then V\X is a basis for v. 


Proof. Suppose not, then there is a nontrivial linear combination of the vectors 
in v(V\ X). This means that there is a (row) vector c that is supported on V\X 
and orthogonal to every row of B. This means that c € L, and so it is a nontrivial 
linear combination of the rows of A. Restricting the rows to X, we see that cl = 0 
is a nontrivial linear combination of the vectors a;|. But this is impossible, since 
the vectors a;|x form a nonsingular dx d matrix. 


More generally, we can express the rank of a subset of V in the labeling v in 
terms of the rank of its complement in the labeling u: 


(19.12) rk(v(X)) = rk(u(V \ X))+|X|-d. 
This follows by a simple computation; it is easy to check on Wenn diagrams that 
for any two sets X,S CV, 

(V\ S)NX| = |SAO(V\X)|+|X]—[5]. 
Letting S' range over all bases for u, we get (19.12). 
Example 19.18. A nontrivial example of Gale duality comes from the theory of 
orthogonal representations, more exactly, from the duality of the TSTAB body. 
Let z > 0 be a boundary point of TSTAB(G), and let a'x = 1 be a hyperplane 
supporting TSTAB(G) at z. Then by Corollary |11.27| a is a boundary point of 


TSTAB(G). By Proposition |11.26} a is the profile of an orthogonal representation 
(u,c) of G in R¢, and similarly, z is the profile of an orthogonal representation 
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(v,d) of Gin R®. Then a'z = 1 implies (by the same argument as in the derivation 
of (11.19)) that 
S (clui)(d'vi)(ujovi) = cod. 


Defining G; = (d'v,)u; and ¥; = (c'u;)v;, we get that 
» U,0V; = cod. 
i 


This means that the matrices U = (U1,...,U,,c¢) and W = (¥v1,...,U,, —d) satisfy 
UW! = 0. The columns of W form almost a Gale dual of the columns of U. I 
say “almost”, because ranks of U and W do not necessarily add up to n+1. We 
can, of course, add more rows to (say) W to make it a full Gale dual, but perhaps 
the following property is more important: for every i,7 € V, either U; and uj are 
orthogonal, or V; and vj are. 4 


19.3.4. Bipartite matching revisited. As an application of Theorem}19.15 
let us give a new proof of the Marriage Theorem for frameworks (Theorem |18.14), 


or, more precisely, in the following form (which is just the Matroid Intersection 


Theorem [Edmonds 1970], for representable matroids; see Section|19.5). 


Theorem 19.19. Let u: V > R¢ andv: V > R® be two vector labelings of the 
same finite set. Then 
max{|X|: X CV, u(X) and v(X) linearly independent} 
= min{rk(u(Y))+rk(w(V\Y)): Y CV}. 

To see the connection with the Marriage Theorem, first note that Theorem 
19.19] is a special case of Theorem 18.14} when the bipartite framework consists 
of disjoint edges. Conversely, starting with an arbitrary bipartite framework, we 
can delete isolated nodes and split each node into nodes of degree 1 with the same 


vector label. This does not change the values of 7 and v, and so it reduces the 
general case to the case when each node has degree 1. 


Proof. With the notation of the theorem, we may assume that u(V) spans R@ 
and v(V) spans R°. For a set X to be considered on the left side, u(X) must be 
linearly independent, so X extends to a basis for u, and similarly, X extends to a 
basis Z for v. So the left side can be formulated as follows: 


max{|YNZ|: u(Y) is a basis of R¢, v(Z) is a basis of R°}. 


Let w: V — R”© be a Gale dual of v, then Z is a basis of v if and only if 
Z' =V\Z isa basis of w. Since 


IYAZ| =|YUZ|-|Z"| =|YUZ'|—-(n—e), 


finding the maximum of |YM Z| is equivalent to finding the maximum of |YU2Z’|, 
where Y is a basis for u and Z’ is a basis for w. But this is answered by the special 
case k = 2 of Theorem|19.15 


min { |V\X|+rk(u(X)) +rk(w(X)) } 


= min {n— |X |-+rk(u(X)) +rk(v(V\ X)) + |X| —e}. 


max |YUZ’| 
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and so 
max |YN Z| = max|YUZ"|—(n—e) = minrk(u(X)) +rk(v(V \ X)) 


as claimed. 


19.4. Generic hyperplanes 


19.4.1. The rank of generic intersections. The following theorem was 
proved in [Lovdsz 1977] (with a somewhat sketchy proof; see also Section 
below for its matroid-theoretical predecessors). 


Lemma 19.20 (Generic Hyperplane Lemma). Let F be a family of nonzero 
subspaces in R¢, and let H be a linear hyperplane that is transcendental over the 
field generated by the fields Ky, (L € F). Let F’ ={HNA: AEF}. Then 


rk(F') = min{rk( Gi) +--+ +rk(Gx) — kh, 
where {G1,...,Gx} ranges over all partitions of the set F. 


Proof. Define A’ = HMA for subspaces A C R4, and let G/ = {A’: Ae Gh 
for G C F. Note that lin(G’) C lin(G)’ (but equality does not always hold). Hence 
rk(G’) < rk(G)—1 if G’ £0 (here we use already that H is transcendental), and so 
for any partition {G1,...,Gx} of F, 

k 


k(F’) <M) < Da (rk(G.)— 


We want to prove that equality en here for a suitable partition of F. 

Let Q = lin(F’). Clearly Q C H. For every A € F, we have dim(A’) = 
dim(A)—1 (since H is transcendental). Since A’ C Q (by the definition of Q), 
but A Z Q (as A Z H), this implies that QO. A = A’. Hence dim(QUA) = 
dim(Q) + dim(A) —dim(A’) = dim(Q) +1. 

Let Ni,...,N, be all the different subspaces of the from Q+A (A € F). It 
follows from the above that dim(N;) = dim(Q)+1 and N;N.N; = Q for i F 7. 
Let G; = {AE F: Q+A = Nj}. Clearly Gi,...,G, form a partition of F. Set 


(Figure (19.1). 


The key step in the proof is the following claim. 
Claim 1. The subspaces M! are linearly independent. 


Suppose this is not the case, and let Mj,..., 4! be a minimal family of these 
subspaces that are not linearly independent, so there are nonzero vectors x; € Mj 
such that x; +:--+x, = 0. Let X = Mj+---+M/_., then x,_1/X 4 0 (else, 
M;...,M/_, would be dependent), and x,_1/X+x,/X = 0. Hence M,_1/XNM 
M,./X £0. 

On the other hand, (M,-1+X)N(M,+X) C N,-1NN, = Q, and hence 
M,-1/XNM,/X = Q/X C H/X. Notice that M{,...,M/_. are linearly inde- 
pendent by the minimality of r, and so X = M{@---@Mj/_,, where every M} is 
the intersection of a rational subspace G’, with H. So Lemma|19.11|can be applied, 
and we obtain that H/X = HNX+ is transcendental within X+. It is easy to 
see that the subspaces M,_1/X and M,./X are algebraic within X+, and so H/X 
cannot contain M,_1;/X1M,/X 4 0, a contradiction. 

We can strengthen this Claim a bit: 
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FIGURE 19.1. Intersecting a family of subspaces {M1, Mz, M3} by 
a generic hyperplane H (the picture uses the representation in 
projective space). Here k = 3, and each G; consists of a single 
2-dimensional subspace. If the (1-dimensional) subspaces Mj, M3 
and M3 are linearly dependent, then projecting from Mj, the image 
of H will go through the common image of M$ and M3, so H cannot 
be generic. 


Claim 2. The subspaces M,,M},..., Mj, are linearly independent. 


Indeed, if x1 € M,, x; € M} for i = 2,...,k, and x,4+---+x, = 0, then 
x, € H (since all the other x; belong to H), and so x; € M} for alli. By the Claim, 
this implies that every x; = 0. 


Now it is easy to complete the proof. Claim 1 implies that L,,...,L, are 
linearly independent: 
(19.13) dim(LZ,)+---+dim(L;) = dim(Z,+---+L,) = dim(Q). 


Claim 2 implies that 
dim(M,)+dim(L2)+---+dim(L,) = dim(M, +Q) = dim(N;) = dim(Q) +1. 
It follows that 


(19.14) dim(Z,) = dim(M,) —1. 
Applying (19.14) to the other subspaces M; as well, and using (19.13), we get 
k k k 
rk(F’) = dim(Q) = S > dim(L;) = 50 (dim(M;)—1) = So (rk(G;) -1). 
i=1 i=1 i=1 


19.4.2. Generic hyperplanes and rigidity. As an application of the 
Generic Hyperplane Lemma, we give a description of generically rigid graphs and 
new proofs of Theorem [15.11] (the equivalence of (a) and (b)) and Theorem 

Consider a framework (Gu) in the plane. As shown in Section[15.2| the frame- 
work is stress-free if and only if the matrices Ub.b! are linearly independent, where 
U is the 2x V matrix whose columns are the vectors u;, and be = e;—e; for 
e = ij € E. These matrices have two properties: first, they are of the form xb’, 
where x. € R?; second, they satisfy the equation (RU)-X = 0, where R is the 
counterclockwise rotation by 90° in the plane. This equality is easy to verify by 
direct computation, or through the argument 


(RU)-(Ub.b!) = tr(RUb!b,U') = R-(Ubib.U') = 0, 
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since R is a skew symmetric 2x2 matrix, while Ub!b.U' is a symmetric 2x2 
matrix. 

So for every edge e we have a 2-dimensional linear subspace L, of 
consisting of the vectors xb/. Furthermore, we have a hyperplane H defined by 
(RU). X =0. The matrix Ub,b] is a (typically nonzero) vector in the intersection 
of this line with this hyperplane. 

Now assume that U is generic; then the hyperplane is generic (with respect to 
the subspaces L,) Ub.b! # 0, and the intersections HNL, are 1-dimensional, i-e., 
they consist of scalar multiples of Ub.b!. The Generic Hyperplane Lemma|19.20] 
can be applied, and gives that 


2xV 
R*", 


k 
19.15 k{Ub.b!: e€ E} = mi k{L,.: e€ E;}-1), 
(19.15) re(UbdDE: €€ B}= ,min, Yokes e€ Fi} 
where {F1,..., £,} ranges over all partitions of E. 


It is not hard to figure out the rank of {Z.: e € E;}. If the edges in FE; form 
a connected graph G;, then the vectors b, (e € F;) generate all vectors supported 
on V; = V(G;) whose entries sum to 0. The dimension of this space is |V;|—1, and 
so rk({Ze : e € E;}) = 2(|Vi|—1). If G; is disconnected, then we have to sum 
this over all components, but in this case we improve the value on the right side of 
if we split E; into its connected components. So we get that 


k 
; bb: e€ E}=_ mi 2|Vi| —3). 
(19.16) rk{Ubebz : ¢ € E} = min, d| |\Vi| 3) 
To get a rigid graph, this value must be 2n—3 (the dimension of Mat in this case), 
which is just the partition condition in Theorem|15.14| To get a stress-free graph, 
we need that this dimension is at least m, in other words, >> ,(2|V;|—3) > m for 


every partition {F),...,£,} of E. We can write this condition as 
k 

(19.17) >5(2\Vi| -3—|Bil) > 0. 
i=1 


If Laman’s condition holds, then all terms here are nonnegative, and so (19.17) is 
valid, and the graph is stress-free. Together with the trivial implication (a)=> (b), 


this proves the equivalence of conditions (a) and (b) in Theorem |15.11} (This 
15.11{c).) 


approach does not yield the equivalence of Henneberg’s condition 


19.5. Matroids 


Matroids are common generalizations of several “independence” notions (lin- 
ear independence, algebraic independence, circuit-freeness in a graph). This con- 
cept was introduced by |Whitney 1935); independently, a couple of years later 
observed that linear and algebraic independence satisfy 
the same basic properties. Here we recall some very basic facts only, which are nec- 
essary for describing the matroid theoretic aspects of our discussions in this book. 
There are several excellent monographs on the subject [Recski 1989], [Oxley 1992], 
covering much more. In particular, the reader is referred to these monographs or 
to any of the numerous survey articles for the proofs of the results quoted below. 
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19.5.1. Basics. A matroid is a pair (EM), where £ is a finite set and M C 
2” is a family of its subsets called independent sets, satisfying the following axioms: 


(M1) 0EM; 
(M2) If X € M and Y C X, then Y eM; 


(M3) If X,Y € M and |X| > |Y], then there is an element x € X\Y such that 
YU{a} eM. 

These axioms impose a lot of structure on the matroid. It follows that all max- 
imal (by inclusion) independent subsets of a set S C E have the same cardinality, 
which we call the rank of S, and denote by r(S). Maximal independent subsets of 
E are called bases; they all have the same cardinality r(£), which we call the rank 
of the matroid, and denote it simply by r. 

Maximal subsets of S with a given rank are called flats. Flats of rank r—1 
are called hyperplanes. Minimal dependent sets are called circuits. There are many 
important properties of the rank function, of flats and of circuits; we need some 
basic properties of the rank function only. 

It will be useful to consider a larger class of setfunctions. Let f : 2” > R bea 
setfunction. We say that f is monotone, if Y C X C FE implies that f(Y) < f(X); 
it is called submodular, if 


F(XNY)+f(XUY) < f(X)+F(Y) 


for every two sets X,Y C E. In this brief section we need integer-valued set- 
functions only, but there are interesting examples of submodular setfunctions that 
are not integer-valued (see Exercises [19.11f19.13). An integer valued, monotone, 
submodular setfunction f with f(@) = 0 is called a polymatroid. Restricting a 
polymatroid to the set of elements of rank at most 1 we get a matroid. 

The rank function of a matroid is integer valued, monotone and submodular, 
and in addition it satisfies the trivial conditions that r(@) = 0 and r({x}) < 1 for 
every x € FE. Conversely, these conditions characterize rank functions of matroids. 

The terminology above already suggests that matroids are motivated by two 
basic examples. 


Example 19.21 (Linear matroids). In this most basic example, the underlying 
set of the matroid as a finite set E C K@ (where K is any field), and independence 
mean linear independence. The rank of a set is the dimension of the subspace it 
generates, and flats are intersections of F with linear subspaces. We also say that 
E represents the matroid over the field K. 

Which matroids are representable over which fields? This question leads to a 
deep and rich theory, which is, alas, beyond the scope of this book. 

Many properties of sets of vectors used in this book are “matroidal” (depend 
only on the matroid induced by these vectors). For example, F is in general position 
if and only if it induces the matroid consisting of all sets of cardinality at most d. 
This matroid is called the uniform matroid of rank d. As a special case, we get the 
free matroid, in which every subset of F is independent. 4 


Example 19.22 (Graphic matroids). Let G = (V,E) be a multigraph, and 
assume for convenience that it is connected. Let M consist of those subsets of E 
that contain no cycles; in other words, they are edge-sets of those subgraphs of G 
that are forests. Then (F,M) is a matroid, called the graphic matroid induced by 
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G. Then rank of (F,M) is n—1, the bases of (EL, M) are the spanning trees of G, 
and the circuits of (£,M) are the cycles in G. 

Graphic matroids are all linear; in fact, they can be represented by vectors over 
an arbitrary field Kk. 4 


Let us describe a third class of matroids, which is not as well studied as the 
previous two, but is tied to the contents of this book, in particular to Section 


Example 19.23 (Algebraic matroids). Let KK be a subfield of a field 1, and 
let E C Ky bea finite set. Let M consist of those set X C EF that are algebraically 
independent over K. The (E,M) is a matroid, which we call an algebraic matroid 
over the field K. The transcendence degree of a set of numbers, used in Section 
is just the rank in this matroid. Some basic matroidal properties of algebraic 
independence have been used implicitly in that section. 4 


Let us describe an example of very unstructured matroids, which can serve as 
counterexamples of many conjectures motivated by the more structured examples 
above. 


Example 19.24 (Paving matroids). For a given rank r > 0 and underlying set 
FE, start with a family S of subsets of E such that the intersection of any two of 
them has at most r—2 elements. Let M consist of all subsets X C EF such that 
|X| <r, and |XNA| <r for every A € S. (In particular, every set of size r—1 or 
less belongs to M.) Then (£,M) is a matroid, called the paving matroid induced 
by S. 

The paving matroid of rank 4 on the set {1,...,8} induced by the family of 
5 sets {{1,2,3, 4}, {1, 2,5, 6}, {1, 2, 7, 8}, {3, 4,5, 6}, {3, 4,7, 8}}, called the Vamos 
matroid is an example of a matroid that is neither linear nor algebraic. 4 


We also need the following very simple example of a submodular setfunction. 


Example 19.25. Let H be a hypergraph. We define a setfunction on the subsets 
of E = E(H) by 
f(S) = |us| =| 4]. 
AES 
More generally, we can consider a family F of flats in a matroid (as the most 
important special case, a family of finite dimensional linear subspaces of a linear 
space), and define setfunction on the subsets of F by 


f(S) =r(US) (oe CF), 


This setfunction is trivially integer valued, monotone, and satisfies f(@) = 0. It 
is not hard to see that it is submodular, so it is a polymatroid. A polymatroid 
obtained from a family of linear subspaces is called linear. 4 


19.5.2. Duality. For a matroid (£,M), we construct its dual matroid 
(E,M*), in which a set X C E is independent if and only if E\X spans the 
whole space in (EZ, M) (in other words, it contains a basis). It is not hard to verify 
that this is indeed a matroid. It also follows that (M*)* = M. The bases of M* 
are the complements of the bases of M. The rank function r* in the dual matroid 
can be expressed by the formula 


r’(X) =r(B\X)4+|X|—r(£). 
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The construction of the Gale dual in Section{19.3.3]implies that if a matroid (E,.M) 
is representable over a given field, then so is (EF, M*). Note that while the Gale 
dual is not uniquely determined, its underlying matroid is. 

The dual of a graphic matroid is not graphic in general; in fact, it is graphic if 
and only if the graph is planar. If the graph is a planar map, then the dual of its 
matroid is the matroid of the dual map. 


19.5.3. Generic points and hyperplanes. The Generic Point Lemma is 
closely related to a purely matroid-theoretical result [Edmonds 1970]. One way 
to introduce this is to construct an extension of a polymatroid (FE, f) with a new 
element a, of which we think of (for the sake of a geometric motivation) as a generic 
k-dimensional “subspace” of an element b € E in the polymatroid. Here the word 
“generic” should mean that a “sticks out” of the span of any subset X C E as 
much as submodularity allows. In formula: 


(19.18) f(XU{a}) = min{ f(X) +k, f(XU {b}). 
It is not hard to very that this extended function is still a polymatroid on the 
subsets of EU{a}. In particular, we can extend (EF, f) by a point (element of rank 


1). Repeating this with all elements of E, and keeping track of the rank function, 
we get the following theorem. 


Theorem 19.26. If f : 2” — Z is a polymatroid, then 
My, ={X CV: |XNS|< f(S) for all SCV} 
is a matroid, whose rank function is given by 


r(X) = min (f(¥)+1X\¥\). 


Lemma|19.12|concerns the special case when the polymatroid is linear. In this 
setting, the content of the theorem is that a generic choice of representative points 
provides a linear representation of the matroid My. So Lemma|19.12|says less than 
Theorem [19.26]in the sense that it concerns only linearly representable submodular 
functions, but more in the sense that it gives a linear representation of the induced 
matroid. This geometric view of matroid-theoretical results was worked out by 
[Mason 1977]. 

Similarly, the Generic Hyperplane Theorem is closely related to a result about 


submodular functions [Edmonds 1970}: 


Theorem 19.27. If f : 2” > Z is a submodular setfunction that is nonnegative 
on nonempty sets, then 


f'(X) = min(f(X1) +--+ f(Xx)) 


(where {X1,...,X,} ranges over all partitions of X into nonempty sets) defines a 
submodular setfunction. 


Clearly f’(0 = 0. If f(@) > 0, then f’(X) = f(X) for every nonempty set; so 
the only interesting case is when f(@) < 0. As a result on submodular setfunctions, 
the last theorem says that f’ “fixes” this problem of f: it is the unique submodular 
setfunction that is nonnegative, f’(0) = 0, f’(X) < f(X) for X ¥ 0, and which 
majorizes every other submodular setfunction with these properties. 

What Lemma{19.20]adds to this result is that if g is a linear polymatroid defined 
by nonnull subspaces, and we define f = g—1, then f’ is a polymatroid realizable 
by the family of intersections of these subspaces with a transcendental hyperplane. 
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This result can be combined with Theorem [19.26 


Corollary 19.28. Let f : 2” > Z be a submodular setfunction that is positive on 
nonempty sets. Then 


* ={ACV: |ANX|< f(X)-1 forall X CV, X #9} 
is a matroid, whose rank function is given by 


r(X) =, min | [¥ol+(f(4)—1) +--+ (FV) D) 


seiles) 


where {Y0,..-,Yx} ranges over all partitions of X. 


Again, if f is linearly representable, then so is the matroid My. Starting with 
the free matroid on a set V, we get the graphic matroid induced by the complete 
graph on V. 


19.5.4. Matroid sum. We have come to a point where a matroid theoretical 
formulation of the results is not only be more general, but allows for a more elegant, 
more conceptual treatment. Given two matroids (E,M,) and (FE, M2) on a com- 
mon underlying set, we can construct the set system M,V My, = {X,UX2: X;€ 
M,;}, which we call the sum of the matroids (Z,Mj,) and (£, M2). 


Theorem 19.29. The set-system (E,M1VMg2) is a matroid. The rank function 
? of (E,MiV Me) can be expressed in terms of the rank functions r; of (E,M;) 
as follows: 

A(X) = maxycx(r(¥)+re(X\Y)) = minyex {ri(¥)+re(¥) +1X\ YI}. 


Note that this gives a min-max (NPMco-NP) characterization of this rank func- 
tion. 

It is straightforward to define the sum (F,Mi1V---VMg,) of k matroids 
(£,M,) on the same underlying set and generalize the above expressions for its 
rank function. Corollary [19.16] is the special case when we add up the same lin- 
ear matroid k times, and it generalizes to matroids in a straightforward manner 


(Exercise [19.14p. 


19.5.5. Matroids and rigidity. Consider a vector labeling u of the complete 
graph K,, in R¢, where we assume that n > d+1 to avoid some trivial complications. 
Stress-free sets of edges of K,, define a matroid on E(K,,), which we call the d- 
dimensional rigidity matroid of (K,,u). For an explicitly known representation u, 
we can do computations in this matroid using elementary linear algebra. 

It is not hard to see that for a full-dimensional vector labeling u, the rank of 
this matroid is dn— @t) (this specializes to our favorite value of 2n—3 for d = 2). 
A graph on V(K,,) is rigid in this realization if and only if its edges form a basis of 
this matroid. 

Now assume that u is generic; trivially, this leads to the richest matroid on 
E(kK,,) (with the largest family of independent sets), which we call the generic 
rigidity matroid on n elements. 

In the case d = 2, we can invoke matroid theoretical results through the fol- 
lowing argument. The setfunction |V(£)| (E C E(K,,)) is submodular (a special 
case of Example (19.25), and hence so is the setfunction f(£) = 2|V(E)|—3. This 
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setfunction is nonnegative on every nonempty set of edges, but negative for E = 0. 
By Theorem|}19.27| we can define a nonnegative setfunction f’ by the formula 


k 
7S min ) | f (Xi), 


where {X1,...,X,} ranges over all partitions of X into nonempty sets. We see 
that f’({x}) = 1 for every singleton set {x}, and so f’ is the rank function of a 
matroid, which is just the rigidity matroid. The graph G is generically rigid if and 
only if f’(E) = 2n—3, and hence we get the partition condition for generic rigidity. 

For higher d, we can also consider the submodular setfunction f(X) = 
d|V(X)|— Ce, and “fix” it similarly as for d = 2. This is more complicated 
than for d = 2, but the main problem is that, as we have seen, it does not capture 
the generic rigidity matroid; it is a freer matroid for d > 2 and large enough n 
(having more independent sets). 


19.5.6. Graphs on matroids. The problems treated in Chapter offer 
themselves to a generalization to matroids. We can label the nodes of a graph G 
by the elements of a matroid; it will be convenient to identify the nodes with their 
labels, so we consider a graph whose nodes are the elements of a matroid. So we 
consider pairs (G,M), where G is a finite graph and (V(G), M) is a matroid. 

All the quantities introduced in Chapter [18]can be defined. We define 7(G, M) 
as the minimum rank of a node-cover. We define a matching as a family of disjoint 
edges whose union is independent in the matroid, and v(G,M) as the maximum 
cardinality of a matching. 

Most of the results about covering and cover-critical graphs in Sections [18.2.1] 
and extend to this more general setting with essentially the same 
proofs. The most notable exception is Theorem [18.8{(b), whose proof uses more of 
the algebra of linear spaces than just dependence-independence. For cover-critical 
graphs on general matroids, only a weaker bound 


(19.19) mage) 


seems to be known. 

For matchings, the situation is worse: The problem of computing v(G,M) 
contains NP-hard problems for explicitly described matroids. To show a simple 
example, let G be a graph and k > w(G). We want to decide whether w(G) = k or 
w(G) <k. It is clear that this is an NP-hard task. 

Consider the graph H = GX Ko, and let F be the family of (2k)-cliques in H 
(this may be an empty family). We define an almost trivial graph on U = V(#) 
by connecting nodes (7,1) and (7,2) for every i € V(G). All the information we 
need about G goes into the matroid (U,M), which is the paving matroid of rank 
2k defined by the family F. (Every (2k)-clique in H is of the form BxV(K3), 
where B is a k-clique in H, so any two sets in F have at most 2k—2 elements in 
common.) Now v(G,M) = & if and only if w(G) = k. So to determine v(G, M) 
for this graph on this matroid is NP-hard. 

The matching problem for graphs on matroids can be solved in polynomial time 
when the matroid is linear, but not in general. One can go beyond the class of linear 
matroids: for example, it can be solved for algebraic matroids . 
To illustrate how the same geometric argument can lead to very different results 
when applied in different matroid models, let us state one version of this result. 
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Theorem 19.30. Let K, C Kz be two fields, and let uj,...,Un,V1,---,Un € Ka. 


Then the maximum cardinality of a subset {i1,...,in} C {1,...,n} such that 
Ui, Vizy+ ++; Win, Vi, are algebraically independent over Kk, is given by the formula 


min{ deg.,(K|K1) + | dese (fuse > £6 S} | K)|, 
j= 


where the minimum is taken over all field extensions K of Ky, and over all parti- 
tions {S),...,S,} of {1,...,n}. 


A larger class of matroids to which the min-max theorem about matchings can 


be extended was studied in |Bjorner—Lovasz 1987). However, the complexity of the 


matroid matching problem is still not completely understood. 


Exercise 19.1. Let v(#) denote the maximum number of elements in a linearly 
independent family F, and let nin(#) be the minimum dimension of a subspace 
intersecting every element of F in a nonzero vector. Prove that v(F) < nin(F), 
and show that equality does not hold in general. 

Exercise 19.2. Prove Lemma|[19.2 

Exercise 19.3. Construct a vector-labeling u of K4 such that 7(Ka4,u) = 3 but 
TNiin(FK4,u) = 2. 

Exercise 19.4. For each edge ij of a framework (G, v), let Lj; = lin{vi,v;}, and 
let Fa,y be the family of these subspaces. (a) Prove that v(Fa,v) = v(G,v). (b) 
Prove that if the vector labels are linearly independent, then nin(Fe,v) = T(G, v). 
(c) Prove that Tin(Fa,v) < 7(G,v) for all vector labels. 

Exercise 19.5. Let Ai,...,An be p-sets of points in R”, and let Bi,..., Bn be 
q-dimensional subspaces of R*.. Suppose that A;MB; = 0 for alli, but A;N B; 4 0 
for 1 <j (no condition on i > 7). Prove that n < (Pas 


Exercise 19.6. Let H be a family of 2-dimensional subspaces of a linear space, 
such that rk(H) = 2|H|—2 and rk(H\{A}) = 2|H|—2 for every A € H. 
(a) Prove that there is a unique partition H = {Hi,...,Hx} such that 
2\F|-2 if F=H, 
rk(F) = ¢ 2|F|—1_ if F contains exactly k—1 of the classes Hi, 
2|F\, otherwise. 


(b) Prove that if k > 3, then lin(H\H1i)N---Nlin(H\H.z) 4 0. 


Exercise 19.7. Let L be a algebraic subspace of R@ and J, any other subspace. 
(a) Prove that Kr4; C Ky; and Kray C Ky. (b) Prove that L/J is algebraic 
within J+. 

Exercise 19.8. Let L C R? and J C L be subspaces. Prove that J is transcenden- 
tal within L if and only if J has a basis obtained by taking r linear combinations 
of a standard basis of L with coefficients algebraically independent over Kz. 


Exercise 19.9. Derive Theorem |15.16]from Corollary |19.16 


Exercise 19.10. Prove that every polymatroid can be represented by a family 
of flats of a matroid as in Example|19.25 


Exercise 19.11. Let Ai,...,An be random events, and for X C [n], define 
f(X)=P( A Ai). 
ieX 


Prove that f is a submodular setfunction. 
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Exercise 19.12. Let ai,...,an be linearly independent vectors in R”. For X = 
{i, ee ste} < [n], let 
f(X) = Involk(P(ai,,.--,ai,)), 
where P(aj,,...,a:,) denotes the (k-dimensional) parallelotope spanned by the 


vectors aj,,...,a:,. Prove that f is a submodular setfunction. 


Exercise 19.13. Let A be an nxn positive definite matrix, and let Ax denote 
the submatrix formed by its rows and columns in X (X C [n]). Prove that 
Indet(Ax) is a submodular setfunction. 


Exercise 19.14. Let (E,M) be a matroid. Prove that E can be partitioned into 
k independent subsets if and only if 

kr(X) > |X| 
for every X C E. 
Exercise 19.15. Let (V,M) be a matroid and H C 2”, a family of r-sets. Define 
7TM(H) as the minimum rank of a subset X C V meeting every set in H. Suppose 
that H is critical in the sense that tu(H\{A}) < tm (H) for every A € H. Prove 


that 
|H| < pT (CH) 
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CHAPTER 20 


Concluding Thoughts 


We have seen a fairly large variety of geometric representations. Is there a 
way to fit these into a single theory? Perhaps not, considering the abundance of 
possibilities how the graph structure can be reflected in geometry. But while writing 
this book, more and more common ideas between this variety of representations 
have emerged. Let us describe some of these (admitting that they form a germ of 
a coherent theory at most). 


20.1. Non-degeneracy 


A common theme in various geometric representations is that imposing non- 
degeneracy conditions on the representation often makes it easier to analyze and 
therefore more useful (most often, by eliminating the possibility of numerical co- 
incidence). There are at least four types of non-degeneracy conditions that play a 
role in several types of geometric representations (and so they pop up in several 
chapters of this book). It is interesting that different non-degeneracy conditions can 
be imposed on one and the same representation model, leading to different combi- 
natorial conditions; we have seen this in connection with orthogonal representations 
as well as with rigidity and motions of frameworks. 


Faithfulness. This is perhaps the most natural non-degeneracy condition. As 
a simple example, we want to represent a graph so that adjacent nodes are at unit 
distance (recall Section (13.2.1). Such a representation is faithful, if nonadjacent 
nodes have distance different from 1. We have seen in Section that. faith- 
ful orthogonal representations behave quite differently from arbitrary orthogonal 
representations. This is a very natural but often rather weak condition. 


General position. Recall that a vector representation in R@ is in general po- 
sition if no d+1 of the points are contained in a hyperplane. This property of 
quite different geometric representations is often connected to the connectivity of 
the graph. In Section 3.3} we characterized graph connectivity by the general posi- 
tion of rubber band representations; in Section by the existence of orthogonal 
representations in general position; and in ee by the existence of glob- 
ally rigid bar-and-joint frameworks in general position. Orthogonal representations 
in general position were the key to several properties of the variety of orthogonal 
representations. A quantitative form of general position, “respecting the volume”, 
lead us to an efficient approximation algorithm for the bandwidth of a graph. 


Genericity. The most powerful nondegeneracy condition is that the represen- 
tation is generic, which means that the coordinates of node positions (sometimes 
other numerical data) are algebraically independent real numbers. This cannot 
happen, of course, for a unit distance representation of any graph with at least one 
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edge; but if we allow arbitrary prescribed distances instead of unit distances, then 
generic representations may exist, playing a central role in the study of rigidity 
properties and flexing of bar-and-joint frameworks (Sections and[15.5p. 

Using generic points and projections was a basic tool in the study of matching 
and covering in frameworks (Chapter [18). We did a general study of “genericity” 
in subspaces, and applied it in the combinatorics of subspaces (Sections and 
19.4). 

Genericity certainly avoids unwanted numerical coincidences between the data, 
but it may seem quite far from anything practical; transcendental numbers are 
almost impossible to explicitly describe and do computations with. But there is a 
very simple way out: choose the coordinates randomly, independently and uniformly 
from [0,1]. With probability 1, these coordinates will be algebraically independent. 
This seems to be of no help though, since to describe a random point of [0,1], 
we need infinitely many digits. Here is where general lemmas (see the Schwartz— 
Zippel Lemma{3.12) come to help, showing that if any substitution for the variables 
eliminates unwanted algebraic relations, then random choice from any sufficiently 
large finite set does so. 


Transversality. To be generic is a condition that is often too strong; this leads 
us to what is (I feel) the deepest nondegeneracy notion, transversality (closely re- 
lated to the Strong Arnold Property of matrices). This condition was introduced in 
a general context in Section [10.5} recall that it means that, writing the constraints 
on the representation as algebraic equations, the corresponding hypersurfaces in- 
tersect transversally. Thus the existence and properties of the representation (the 
intersection point of these hypersurfaces) are not just accidental, but forced by 
some more fundamental structure. If a solution of the system is transversal, and 
we perturb each equation by a small amount, then we get another solvable system 
with a solution nearby. 

In graph theory, this condition was first used by Colin de Verdiére, in his 
beautiful work on the “Colin de Verdiére number” (Chapter [16) and on a version 
of tree-width (algebraic width, see Section But in a sense, in the study of 
rigidity of bar-and-joint frameworks, the transversality condition played a central 
role ever since the study of such frameworks began in the 19‘ century: for such a 
framework, it corresponds to stress-freeness, a central notion in Chapters and 


[15] 


Can we tell what kind of combinatorial properties are related to these different 
kinds of nondegeneracy conditions? Probably not in general, but a couple of points 
are worth mentioning. 

As we have seen, representations in general position are often related to connec- 
tivity properties of the graph. Connectivity properties of graphs have interesting 
and somewhat surprising connections with geometric representations, and some- 
times, through those, with topological properties. Various versions of the connec- 
tivity property with respect to halfspaces (Section}16.5) are a good example. 

Transversality was instrumental in proving minor-monotonicity of the Colin de 
Verdiére number and of algebraic width. Minor monotonicity is a basic feature of 
a graph property or a graph parameter, and its connection with transversality is, I 
feel, worth of further exploration. 
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Genericity was used most substantially in Chapters [15] and [19] and has strong 
matroid-theoretic connections. The interplay between the two matroids involved 
(the linear matroid of the vector representation and the algebraic matroid of the 
coordinates) is, I feel, not completely understood. 


20.2. Duality 


Many related but different forms of duality have played a role in this book. 

e Gale duality, introduced in Section [19.4] is perhaps the simplest form of du- 
ality. This construction is known under many aliases (dual chain group in matroid 
theory, dual code in coding theory, Gale diagram in the theory of hyperplane ar- 
rangements). We can carry out this construction for any vector representation of 
a graph G, to get a dual vector representation. In some cases, this gives interest- 
ing new constructions; for example, from cover-preserving representations we get 
independence-preserving representations in Chapter [18| 

e Matroid duality generalizes Gale duality, as well as the duality of planar 
maps. 

e Linear programming duality is a basic construction in linear programming 
and through this, in much of applied mathematics. 

e Semidefinite duality is a generalization of linear programming duality, treated 
in detail in Chapter [13| 

e Polarity is a duality construction in convex geometry, which is closely re- 
lated to linear programming duality. Related duality notions are the blocking and 
antiblocking relations for polyhedra in the nonnegative orthant (Section [C.3). Po- 
larity of 3-polytopes is a metric form of planar graph duality. A duality between 
resistance and energy (as can be observed in Section [4.2.1) is related to blocking 
duality. 

e The Weak Perfect Graph Theorem, asserting that the complement of a per- 
fect graph is perfect, expresses a certain duality. This can be extended to a full- 
blown duality (anti-blocking relation) between the TSTAB bodies of a graph and 
its complement, as described in Section [11.4] This duality goes back, ultimately, 
to semidefinite duality, so it is quite interesting that it connects complementation 
of graphs as well as to antiblocking of polytopes. 

e Self-duality in various situations is also worth mentioning. We have met self- 
polar polytopes (Example {11.16} and self-complementary graphs (Corollary 11.7). 
Self-duality of nonnegative orthant and of the positive semidefinite cone are the 
key ingredients behind the duality theory of linear and semidefinite programming, 
respectively. Discrete holomorphic forms are the self-dual subspace of the space 
of circulations on a map (those circulations which define a circulation on the dual 
graph). 

Not all duality-type phenomena are fully understood. The Gale dual is only 
determined up to an affine transformation; for geometric representations with met- 
ric properties (which is the majority), this dualization does not seem to be useful. 
Yet in some cases more than the basic linear structure is dualized. We don’t have 
a general theory for this, but let us briefly describe two examples. 

We have mentioned a complete duality of the TSTAB bodies of a graph and 
its complement. This is a duality between the profiles of the orthogonal representa- 
tions. However, there is a duality-type relationship between the optimal orthogonal 
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representations themselves. Consider an orthogonal representation (u, c) of a graph 
G that is optimal with respect to a weighting w, and construct from it the following 
matrix, adding the handle as a last column: 


U = ((e'us)uy,..., (cu) Un, c) 


Also construct a similar matrix from an optimal dual representation (with a little 
twist in the sign of the last column): 


oe ((d'vi)vi, ath (d'vi)vn, —d). 


Then expresses that every row of U is orthogonal to every row of W. If, in 
addition, u is d-dimensional and v is r-dimensional with d+r = n+1, then W is 
a Gale dual of U. These matrices have the additional property that (disregarding 
the last column) any two columns are orthogonal either in U or in W. Is there a 
deeper duality principle behind this observation? 

Turning to our second example, Theorems [16.15] and [16.16] seem to indicate a 
duality between the Colin de Verdére numbers of outerplanar/planar graphs and 
their complements. This is thanks to two almost-dual geometric representations, 
the nullspace representation of the graph and the l-engaged representation of its 
complement. But while the 1-engaged representation has strong metric properties, 
it is unclear how to impose those on the nullspace representation, and the theory 
is incomplete. 


20.3. Very large graphs and their limits 


It is more and more apparent that the notion of a network is fundamental for 
many branches of science. The networks one studies are often very large, even huge 
with billions of nodes, and investigating such large networks raises novel problems 
and takes novel tools. For example, the graph may not be fully known or only 
be inspectable by sampling (see the Introduction of for a detailed 
description). 

Based on the applications of geometric representations in this book, one would 
like to see how these geometric tools have to be modified to be applicable to very 
large graphs. As an illustration, we may want to compute a useful representation, 
say an optimal orthogonal representation in the sense of Section [11.1] Everything 
we proved about such representations remains valid independently of how large the 
graph is, but to compute the representation we need to know the whole graph, and 
even then, its size may be prohibitive. What can be computed from the limited 
information available (for example, from sampling)? 

Very large graphs are often better understandable if we introduce limit objects 
for sequences of graphs whose size goes to infinity, while becoming more and more 
similar in an appropriate sense. Random graphs with a fixed edge density are simple 
examples. We want to generalize geometric representation notions and results to 
these limit objects. The expectation is that if we succeed in generalizing a graph 
parameter or other graph-theoretic concept to the limiting case, then this indicates 
its tractability for very large finite graphs. 

There are several limit notions for graphs, and the possibilities of extending 
geometric representation theory is quite different for different types of graphs and 
limit notions. We discuss three of these. I tried to collect the rather sporadic results 
about the types of representations considered in this book, and also add some 
speculation about the possibilities for some of our main representation concepts. 


20.3. VERY LARGE GRAPHS AND THEIR LIMITS 391 


Dense limit. Convergence of a dense graph sequences was introduced in 
Borgs et al. 2008). The limit object for a sequence of dense graphs has several 
avatars. Geometric representations may mean different things even for equivalent 
models. 


e A graphon is a symmetric, measurable function [0,1]? — [0,1] 
‘Lovasz—Szegedy 2006]. This is the most commonly used description of the limit 
object. Various basic notions of graph theory have been extended to graphons: sub- 
graph densities, connectivity, spectrum, matchings, coloring etc. (See 
and also and the references there.) 

Some geometric representations have nice extensions to graphons; for example, 
the A-squared representation introduced in Section|9.3}can be defined for graphons, 
as a vector-labeling from a Hilbert space (or from some other Lebesgue space). In 
particular, the 2-neighborhood distance can be defined, and regularity partitions 
can be generalized. Perhaps most importantly, several crucial compactness proper- 
ties of graphons as well as of the space of graphons can be proved with the help of 
this representation. 

On the other hand, it seems to be difficult to generalize orthogonal representa- 
tions to graphons. Can we generalize at least the graph parameter 0(G)? One has 
to be careful about the normalization, to guarantee that the values converge for an 
appropriately convergent graph sequence. Proposition [11.4] and inequality 
suggest that 0(G)/./n may be an interesting normalization. Let me raise a spe- 
cific question: For every graphon W and every n, we can sample from the graphon 
and obtain a “W-random graph” G(n,W). Is it true that for every graphon W, 
0(G(n, W))/./n converges to a limit almost surely as n — oo (the limit may be 0 
or oo)? 

e Another equivalent (cryptomorphic) form of limits of dense graph sequences 
is a local random graph model on a countable set, which is a probability distribution 
on (infinite) graphs on node set N, which is invariant under permutations of the 
nodes, and has the “locality” property that observing two disjoint finite subsets of 
the nodes (two “windows” ), the subgraphs induced by them are independent (in the 
probabilistic sense). This version has the advantage that the node set is countable, 
so (for example) we can generate independent random weights for the nodes, which 
(among others) allows us to define a random ordering of the nodes. Unfortunately, 
I don’t know of any work exploiting this equivalent but different model. 


e A third equivalent description of a limit object is a graph parameter expressing 
the limiting densities of finite graphs in a convergent sequence of dense graphs. Such 
parameters can be characterized by linear inequalities and semidefiniteness condi- 
tions. This formulation is analogous (in a sense generalizes) the theory of moments 
of random variables (see and 
for details). Since moments of random variables have many geometric connections, 
it is likely that such connections will be discovered for these graph parameters as 
well. 


Scaling limit. Turning to the case of sparse graphs (which are much more 
relevant for applications), the first thing to note is that these graphs may be in- 
teresting on various scales. The simplest way of defining these scales is the graph 
distance. Since for sparse graphs the diameter may be very large, we may want to 
scale the distances so that we keep the diameter of the graph fixed, or we may want 
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to keep the edge lengths fixed (and not worry about the diameter). A limit object 
in the first sense is called the scaling limit; this preserves global properties of the 
graphs. A limit object in the second sense is called the local limit; this preserves 
local properties of the graphs. As a very simple but useful example, the limit of 
an nxn grid, as n + oo, may be a full square in the plane, but also an infinite 
grid—depending on what properties we are interested in. 

Scaling limit is a rich and extensively studied limit notion for graphs, and we 
encountered this notion several times, mostly in Chapter|8| but also (not under this 
name) in Section [5.3] where conformal mappings were obtained as scaling limits of 
coin representations. 

Generally, we talk about scaling limits when the graph already has a geometric 
representation. This is often rather simple (say, a lattice domain), and it is some 
other feature whose limiting behavior is interesting. To define and describe scaling 
limits without a predefined geometry (say, for bounded degree graphs which are 
not planar or almost planar) would be very interesting. 


Local limit. Local limit theory has been developed in the extreme case, when 
the degrees are universally bounded by a number D (some extensions to the case 
when the degrees are “almost bounded” are also available). There are again different 
ways of describing limit objects; these are not completely equivalent, but closely 
related. 


e Involution-invariant random. rooted graphs (also called unimodular networks) 
is the original limit concept introduced in [Benjamini-Schramm 2001). This is a 
probability distribution on bounded-degree connected graphs with a specified root, 
under a measure-preservation condition. 

In this latter setting one can define random countable planar graphs, and use an 
appropriate generalization of the coin representation (Theorem|5.1) to prove, among 
others, that the random walk on a random countable planar graph is recurrent with 
probability one. Appropriate generalizations of the square tiling theorems (Chapter 
(6) have applications in a similar fashion [Benjamini-Schramm 1996a}. 

Dual orthogonal representations in a finite dimension d > D exist in this setting, 
and G-orthogonalization makes sense. Results in Section[10.3]remain valid mutatis 
mutandis, but how many new problems and phenomena the infinite setting provides 
remains an unexplored question. 


e A graphing is a bounded-degree graph on [0, 1], whose edge-set is a Borel set 
in the unit square, and a certain measure-preserving condition is satisfied. An easy 
example is a circle, where diagonals of a fixed length are the edges. 

The underlying space of a graphing is often a manifold, and in this case a 
dual orthogonal representation means some sort of a vector field. Dual orthogonal 
representations in dimension d > D exist on every connected component, but it 
is unclear whether they can be chosen in a measurable (let alone continuous or 
smooth) way. Would such a representation tell us anything interesting about the 
combinatorial structure of the graphing? 


20.4. Graphs and geometry 


The interplay between graph problems and geometry has many forms. Often 
the graph we want to study comes together with a predefined geometric represen- 
tation, and the issue is to test certain mathematical and physical properties of the 
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resulting structure, or compute some parameters that connect combinatorial and 
geometric structure. Typical of this problem setting is the rigidity of bar-and-joint 
frameworks. Some basic graph theoretic questions have interesting and natural 
extensions to this situation, as in Chapter [18] where instead of asking for the min- 
imum number of nodes covering all edges, we asked for the minimum rank of such 
a set. Besides raising interesting questions, this extension of the problem was es- 
sential for the solution of a basic problem of classifying cover-critical graphs in the 
original (geometry-free) setup. 

This leads us to the really interesting combination of geometric and combina- 
torial ideas: we start with just a graph and a purely combinatorial problem. We 
try to find a way to represent a graph in a “good” form, helping to find the answer 
to the combinatorial question. Visualizing the structure of the graph this way may 
in itself be very useful, but the most interesting cases are when a proof uses this 
geometric structure in an essential way. Determining the Shannon capacity of the 
pentagon (as described in the Introduction) is a good example of such a proof. 

As it is to be expected in mathematics, the two different starting aspects above 
cannot be strictly separated. For example, Tutte’s rubber band method produces 
a nice planar drawing for planar graphs, but then it can be applied in connectivity 
algorithms, and the basic physical notion it leads to, namely the notion of a stress, 
is fundamental in rigidity theory. 

These connections with physics are classical, but new connections seem to 
emerge, as discussed Chapters |8} and The connection with quantum physics 
is particularly tantalizing. After all, in quantum physics one assigns vectors in a 
linear space to the states of a physical system, even if these states form a discrete 
set; and the mathematical methods discussed in this book do the same. I think 
that the material in the last chapter is only the tip of the iceberg. 

The boundaries of geometry cannot be sharply drawn, and further connections 
between graph theory and other areas of geometric nature could have been included 
in our book: 


e It is difficult to draw a border between geometry and linear algebra, and 
there are many applications of linear algebra in graph theory and other areas of 
combinatorics, which can now be considered classical, and are not treated in this 
book. The lecture notes provide an excellent introduction to 
this method. 


e One highly developed interface between geometry and combinatorial questions 
(mostly combinatorial optimization) is polyhedral combinatorics. Since thousands 
of pages have been written about this topic, I decided not to treat it in this book 
as a topic on its own right. (Of course, some of its basic results I had to use in 


the presentation of other topics.) I refer to [Schrijver 2003] for a comprehensive 


treatment of polyhedral combinatorics. 


e The applications of combinatorial and algebraic topology in the solution of 
combinatorial problems also has its successes, but it is a method with a quite 
different flavor, and I have not treated it in this book. gives a nice 
introduction to this area. 


e In this book, we considered geometry over the real field (with some excursions 
to the complex numbers and a brief excursion to finite fields in Section |3.3). One 
could expect that geometry over finite fields, while perhaps somewhat less expressive 
visually, is even more closely tied to combinatorial problems. There are indeed many 
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connections, from error-correcting codes to expander graphs to quasirandom graphs 
to Ramsey theory. 

I am certain that many new results of this nature will be obtained in the future 
(or are already in the literature, sometimes in a quite different disguise). Whether 
these will be collected and combined in another monograph, or integrated into 
science through some other platform provided by the fast changing technology of 
communication, I cannot predict. But the beauty of nontrivial connections between 
combinatorics, geometry, algebra and physics will remain here to inspire research. 


APPENDIX A 


Linear Algebra 


A.1. Vectors, matrices and vector labelings 


Usually we define vectors as elements of R@ for some nonnegative integer d, and 
we think of them as column vectors (unless said otherwise). But it will be useful to 
consider, for a finite set V, every map V > R as a vector. These vectors form the 
linear space RY. If we label the elements of V by 1,...,n, then RY is isomorphic 
R” in a trivial way. Using the notation [n] = {1,...,n}, the spaces R!"] and R” 
are identical. 

Similarly, matrices can mean any map A: UxV — R, where U and V are 
finite sets; we call this a U x V matrix, and the space of all such matrices will be 
denoted by RY*”. We also say that the rows of A are indexed by the elements of 
U, and the columns, by elements of V. 

A UxV matrix A can also be thought of asa mapa: V — RY. In this way, it 
corresponds to a vector labeling of the set V, assigning to i € V the vector a; € RY. 
We also write (a; : « € V) for such a matrix, and define the rank of the set of 
labeling vectors as rk{a; : i © V} =rk(A). 

A basis of a set of vectors S C RY is a subset of S that is linearly independent 
and spans the same linear subspace as S. A column basis of a matrix A € RU*” is 
a basis of the set of its columns. If we think of A as a vector labeling a: V > RY, 
then it will be convenient to call a subset X C V a basis fora if {aj;: iE X}isa 
column basis of A. 

The all-1 vector in dimension d is denoted by lg. The dxd identity matrix 
is denoted by Ig, the all-one matrix of the same dimensions, by Jg. The vector 
lg € RY and the matrices Ig, Js € R°** are defined similarly for every finite set 
S. In the case when the coordinates are indexed by the set V (usually the node set 
of the graph G) we omit the subscript: 1 ¢ RY and J, J ¢ RY*Y. If A is some 
logical assertion, then 1(A) = 1 if A is true and 1(A) = 0 otherwise. The vector 
lg is the incidence vector of the set S: (1g); = 1(i € S) (where the universe is 
understood). The vectors e; € R@ having 1 in the i-th position and 0 elsewhere 
form the standard basis of R¢. The standard basis of RY can be defined analogously. 

For two subsets A,B C R%, we define A+B = {at+b: a€ A,b € B}. The 
set A—B is defined analogously. For a subset A C R?@, we denote by A+ the set of 
vectors orthogonal to all vectors in A. We need, for linear subspaces A and B, the 
elementary relations (A+)+ = A and (A+B)+ = A+nBt. 

For v € R?%, we denote by |v| its Euclidean norm, and by v° = (1/|v|)v, the 
unit vector in the direction of v (this is only defined for v 4 0). 
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A.2. Eigenvalues 


Let A be an nxn real matrix. An eigenvector of A is a nonzero vector such 
that Az is parallel to x; in other words, Ax = Ax for some real or complex number 
A. This number J is called the eigenvalue of A belonging to eigenvector v. Clearly 
X is an eigenvalue if and only if the matrix A—XI is singular, equivalently, if and 
only if det(A— AI) = 0. This is an algebraic equation of degree n for A, and hence 
has n roots (with multiplicity). 

The trace of the square matrix A = (A,;) is defined as 


The trace of A is equal to the sum of the eigenvalues of A, each taken with the 

same multiplicity as it occurs among the roots of the equation det(A— AJ) = 0. 
We can think of nxm matrices as vectors with nm coordinates. For two 

matrices of the same shape, A, B € R"*™, we define their inner product by 


A-B= tr(A'B) = Se ee 


i=1 j=1 
This should not be confused with the matrix product, but it is related: 
A-B=tr(A'B). 


If the matrix A is symmetric, then its eigenvalues and eigenvectors are par- 
ticularly well behaved. All the eigenvalues are real, and there is an orthonormal 
basis v1,...,Un of the space consisting of eigenvectors of A. Let Ay >--- > An be 
the corresponding eigenvalues, then they are precisely the roots of det(A— AI) = 0. 
The matrix A can be written as 


n 
A= ) AiV4V;« 
i=l 


Another way of saying this is that every symmetric matrix can be written as U'DU, 
where U is an orthogonal matrix and D is a diagonal matrix. The eigenvalues of A 
are just the diagonal entries of D. 

The largest eigenvalue of a symmetric matrix can be expressed by the Rayleigh 
quotient: 

xl Ax 
A, = max —~, 
zeRn\0 |x|? 


and every maximizer is an eigenvector belonging to A;. This formula generalizes: 


if we know the first i—1 eigenvectors v),...,v;—-1, then the 7-th largest eigenvalue 
can be expressed by the i-th Rayleigh quotient: 
an 
x’ Ax 
A; = max —, 
|x|? 


where the maximization is over all vectors x € R” such that x 4 0 but via = 0 for 
j=1,...,i—1. Again, every maximizer is a corresponding eigenvector. 

A principal minor of A is a submatrix B obtained by deleting some rows and the 
corresponding columns. The following important theorem was proved by Cauchy. 
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Theorem A.1 (Interlacing eigenvalues). Let A be annxn symmetric matrix 
with eigenvalues 41 >--+ > Ay. Let B be an (n—k) x (n—k) principal minor of A 
with eigenvalues [ty > +++ > bn—p~- Then 


N= pa = Nth: 


We conclude this little overview with a further basic fact about eigenvectors of 
nonnegative matrices. The following classical theorem concerns the largest eigen- 
values and the eigenvectors corresponding to it. 


Theorem A.2 (Perron-Frobenius). [fan nxn matrix has nonnegative entries, 
then it has a nonnegative real eigenvalue \ which has maximum absolute value 
among all eigenvalues. This eigenvalue X has a nonnegative real eigenvector. If, in 
addition, the matriz has no block-triangular decomposition (i.e., it does not contain 
akx(n—k) block of 0-s disjoint from the diagonal), then X has multiplicity 1 and 
the corresponding eigenvector is positive. 


For us, the case of symmetric matrices is important; in this case the fact that 
the eigenvalue is real is trivial, but the information about its multiplicity and the 
signature of the corresponding eigenvector is used throughout the book. 


A.3. Semidefinite matrices 


A symmetric nxn matrix A is called positive semidefinite, if all of its eigen- 
values are nonnegative. This property is denoted by A > 0. The matrix is positive 
definite, if all of its eigenvalues are positive. 

There are many equivalent ways of defining positive semidefinite matrices, some 
of which are summarized in the Proposition below. We need a definition: the Gram 
matrit of n vectors U1,...,Un € R@ is the matrix Gram(u) = Gram(u1,...,Un) = 
(ip )P jaa where a;; = uj u;. If U is the matrix with columns uj,...,u,, then 
A=U'U. The matrix A does not determine the vectors u; uniquely, but almost: 
since the inner products of the vectors u; are determined, so are their lengths and 
mutual distances, and hence two representations of the same matrix differ only in 
an isometric linear transformation of the space. 


Proposition A.3. For a real symmetric nxn matrix A, the following are equiva- 
lent: 

(i) A is positive semidefinite; 

(ii) the quadratic form x7 Ax is nonnegative for every x € R"; 


(iii) A can be written as the Gram matrix of vectors u,,...,Un € R¢ for some 
d. The smallest dimension d from where the vectors u; can be chosen is the rank 


of A. 


(iv) A is a nonnegative linear combination of matrices of the type xx"; 


(v) The determinant of every principal minor of A is nonnegative. 


It follows that the diagonal entries of any positive semidefinite matrix are non- 
negative, and it is not hard to work out the case of equality: all entries in a row or 
column with a 0 diagonal entry are 0 as well. In particular, the trace of a positive 
semidefinite matrix A is nonnegative, and tr(A) = 0 if and only if A = 0. 
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Even though above we have described a large collection of equivalent neces- 
sary and sufficient conditions for semidefiniteness, simpler sufficient conditions are 
useful. One of these is being diagonally dominant: If a real symmetric matrix 
AER”™” satisfies aj; > >> ji |a;;| for every row, then A is positive semidefinite. 

The sum of two positive semidefinite matrices is again positive semidefinite (this 
follows e.g. from (ii) again). The simplest positive semidefinite matrices are of the 
form aa! for some vector a. These matrices are precisely the positive semidefinite 
matrices of rank 1. Property (iv) above shows that every positive semidefinite 
matrix can be written as a sum of rank-1 positive semidefinite matrices. 

The product of two positive semidefinite matrices A and B is not even sym- 
metric in general (and so it is not positive semidefinite); but the following can still 
be claimed about the product: 

If A and B are positive semidefinite matrices, then A-B = tr(AB) > 0, and 
equality holds if and only if AB = 0. A matrix A is positive semidefinite if and 
only if A-B > 0 for every positive semidefinite matrix B. 

Property (v) above provides a way to check whether a given matrix is positive 
semidefinite. This works well for small matrices, but it becomes inefficient very 
soon, since there are many symmetric minors to check. An efficient method to test 
if a symmetric matrix A is positive semidefinite is the following algorithm: 

Carry out Gaussian elimination on A, pivoting always on diagonal entries. If 
you ever find a negative diagonal entry, or a zero diagonal entry whose row contains 
a nonzero, stop: the matriz is not positive semidefinite. If you obtain an all-zero 
matrix (or eliminate the whole matrix), stop: the matrix is positive semidefinite. 

If this simple algorithm finds that A is not positive semidefinite, it also provides 
a certificate in the form of a vector v with v'Av < 0. 


Mo 


FIGURE A.1. The semidefinite cone for dimension 2. 


Positive semidefinite matrices have some important properties in terms of the 
geometry of the space R"*". The fact that the sum of two positive semidefinite 
matrices is again positive semidefinite (together with the trivial fact that every 
positive scalar multiple of a positive semidefinite matrix is positive semidefinite), 
translates into the geometric statement that the set of all positive semidefinite 
matrices forms a convex closed pointed cone P in R"*” with vertex 0. 

This cone P is important, but its structure is quite nontrivial. In particular, it 
is not polyhedral for n > 2; for n = 2 it is a nice rotational cone (Figure [A.1} the 
fourth coordinate x21, which is always equal to 212 by symmetry, is suppressed). For 
n > 3 the situation becomes more complicated, because P is neither polyhedral nor 
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smooth: any matrix of rank less than n—1 is on the boundary, but the boundary 
is not differentiable at such points. 


A.4. Geometric spaces 


A.4.1. Linear, affine and projective spaces. While from an algebraic 
point of view linear spaces are most basic, in a geometric setting it is often more 
convenient to use (real) affine and projective spaces. An affine space is simply a 
linear space where the location of the origin is forgotten. To be precise, all we need 
to do is to define an affine subspace A of a linear space R@ as a translation of a 
linear subspace L C R¢ by a vector a: A = L+a. Two affine subspaces of the same 
dimension are parallel, if they are translates of the same linear subspace. 

Projective spaces can be defined in two ways. The shortest way to construct a 
projective space P?~! from a linear space R? is by deleting the origin and identifying 
parallel vectors. Projective subspaces are the images of linear subspaces under this 
identification. An important point to remember is that the dimension of P¢~! is 
one less than the dimension of R¢. In particular, a projective plane is obtained 
from a linear 3-space this way. 

If we consider coordinates, then a point in the projective (d—1)-space can be 
represented by a nonzero vector in R?, whose entries are called its homogeneous 
coordinates. We have to keep in mind that two vectors which are nonzero scalar 
multiples of each other represent the same projective point. Each projective (d—2)- 
dimensional hyperplane corresponds to a (d—1)-dimensional linear subspace. This 
can be described by an equation a'x = 0. The entries of the vector a are the 
homogeneous coordinates of the hyperplane. From this description it is clear that 
we can interchange the roles of points and hyperplanes in projective spaces. 

Another way of constructing a projective space is to start with an affine space, 
and extend it with ideal points, where an ideal point is created for each maximal 
family of parallel lines. The ideal point becomes an element of each line in this par- 
allel family. The set of all ideal points is the :deal hyperplane. Every affine subspace 
is extended to a projective subspace by including the ideal points of lines contained 
in it; projective subspaces constructed this way are called ordinary. Additional ideal 
projective subspaces are formed by the ideal points added to an affine subspace; in 
other words, ideal subspaces are the intersections of ordinary subspaces with the 
idea hyperplane. Extension with ideal points does not change the dimension of an 
affine space. 

In the first construction of projective spaces, it is clear that all subspaces of 
the same dimension are “alike”, i.e., they can be transformed into each other by 
linear transformations of V. So even if the second construction seems to distinguish 
between ordinary and ideal subspaces, this is just an artifact of the construction 
method. 

For this book, the significance of these constructions is that they allow us 
to represent configurations of linear subspaces of a d-dimensional linear space as 
configurations of subspaces of a (d—1)-dimensional projective space (Figure [A.2| 
which we can visualize in a (d—1)-dimensional affine space (where we either don’t 
need to think about ideal points, or visualize them as “infinitely distant” ). 


A.4.2. Cross ratio. The cross ratio (vi : v2 : v3 : v4) of four vectors 
V1, V2, V3, V4 in a 2-dimensional subspace of a linear space, no two of which are par- 
allel, is defined as follows: we can write v3 = A,v1+Agvg and v4 = M1 V1 +pe2Ve, 
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FicurE A.2. A family of 2-dimensional linear subspaces in 3- 
space, and their picture as lines in the projective plane. 


and define 

(v1 2V2:V3: va) = (Azpi2)/(Agtn)- 
This number is invariant under scaling of the vectors v; by nonzero real numbers, 
which implies that the cross-ratio can be defined for any four distinct collinear 
points in a projective space. Under duality, cross-ratio can be defined for any four 
distinct hyperplanes sharing a hyperline. 

A much more significant property of the cross-ratio of vectors is that it is invari- 
ant under linear transformations of the linear space (equivalently, under projective 
transformations of the projective space). In particular, projecting a line @ in the 
projective space onto another line @ from any point not on either one of them, the 
cross ratio of any four points is preserved. 

Cross ratios are also important in circle geometry. For four points u,v, w, z in 
the complex plane, we can define the complex cross ratio 


(w—u)(z—v) 
(w—v)(z—u) 


This quantity is real if and only of the four points are on a circle or on a line. It is 
invariant under Mébius transformations of the complex plane (see Section|C.4.1). 


(u:viw:z)= 


A.4.3. Rank, dimension and hull. One potentially confusing thing about 
these different spaces is that notions like “hull” and “rank” mean different con- 
structions. In each case, the “hull” of a set is the smallest subspace containing it, 
a set of points is “independent”, if no element of it is contained in the hull of the 
others, and the “rank” of a set is the maximum size of an independent subset of it. 
Specifically, we define its linear hull of a subset S C R? as 


k 
lin(S) = {So nia: :a,E€S,7,ER, ke n}, 


i=1 


and its affine hull as 


k 
afi(S) = {) mai: ai €S,0 ER, Doi =1, ken}. 


i=1 i 
The rank rk(S) = dim(lin(S)) is just the rank of the matrix A whose columns are 


the vectors in S. We define the affine rank rkag(S) is the rank of the matrix ie 
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We define the convex hull: of S C R® by 


k 
conv(S) = {So nai : a, € S, 2; ER, x; > 0, Soa; =l1,ke N}. 
i=l i 
There are some relations between these notions, like 
conv(S') C aff(S) Clin(S), and aff(S) = S+lin(S—S). 


For a family F of subsets of R%, we use the notation 


rk(F) =rk(UF) =rk( LZ). 
LEF 
If J C R? is a subspace, then for every vector a, we denote by a/J its orthogonal 
projection onto the orthogonal complement J+ of J. For any subset X C R4%, the 
set X/J is defined analogously. 

Generalizing linear independence of vectors, we will call a family {Z,... LD} 
of non-null linear subspaces of R? linearly independent, if every set of nonzero 
vectors £1,...,%n, where x7; € L;, is linearly independent. Two subspaces are 
linearly independent if any only if their intersection is the zero subspace. There are 
equivalent definitions, whose equivalence is easy to prove, but it is useful to collect 
them in a Proposition. 


Proposition A.4. For a family £L = {I1,...Lm} of non-null linear subspaces of 
R¢, the following are equivalent: 

(a) {L1,...Lm} are linearly independent. 
b) Every L; intersects lin(L,U---UL;_1) in the zero subspace (i = 2,...,m). 
c) Whenever x; € Ly and 41 +...&m =0, then a, =---=%m =0. 
d) dim(LZ; +---+L,) = dim(L,)+---+dim(L,,). 
e) For any two indices 1<i< j<m, we have 


lin(L\ {Li}) Mlin(L\ {Ly}) = lin(L\ {Li, L3}) 


In the language of projective spaces, we say that a set of projective subspaces is 
independent, if the corresponding linear subspaces are linearly independent. Propo- 
sition can be reformulated: for example, a family of subspaces in a projective 
space is independent if and only if each of them is disjoint from the (projective) 
subspace spanned by the others. If the family consists of k lines, then they are 
independent if and only if they span a (2k—1)-dimensional subspace. 


( 
( 
( 
( 


A.5. Exterior algebra 


A.5.1. Cross product. The following construction is probably familiar from 
physics to most readers. For a,b € R?, we define their cross product as the vector 
(A.1) axb = |a| |b] singu, 
where @ is the angle between a and b (0 < @ < 7), and u is a unit vector in 
R® orthogonal to the plane of a and b, so that the triple (a,b, u) is right-handed 
(positively oriented). The definition of u is ambiguous if a and b are parallel, but 
then sin ¢ = 0, so the cross product is 0 anyway. The length of the cross product 
gives the area of the parallelogram spanned by a and b. 

The cross product is distributive with respect to linear combination of vectors, 
it is anticommutative: ax b = —bxa, and axb = 0 if and only if a and b are 
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parallel. The cross product is not associative; instead, it satisfies the Expansion 
Identity 


(A.2) (ax b) xc = (a'c)b—(b'c)a, 

which implies the Jacobi Identity 

(A.3) (ax b) xce+(bxc)xa+(cxa)xb=0. 
Another useful replacement for the associativity is the following. 
(A.4) (ax b)'c = a'(b xc) = det(a, b,c) 


(here (a,b,c) is the 3 3 matrix with columns a, b and c). 

Sometimes we use the cross product in the special case when the vectors lie in 
a fixed plane Hl. Let k be a unit vector normal to II, then ax b is Ak, where A 
is the signed area of the parallelogram spanned by a and b (this means that A is 
positive if and only if a positive rotation takes the direction of a to the direction 
of b, when viewed from the direction of k). In this case all the information about 
ax b is contained in this scalar A, which in tensor algebra will be denoted by aAb 
(see below). 


A.5.2. Exterior product. A couple of times we need the more general con- 
struction of exterior product. We need to define a k-tensor over the vector space 
V =R¢asamapT: V* > R that is linear in each variable. This is more restric- 
tive than the usual definition, but it will suffice for our purposes. The simplification 
means that we identify V with the space V* of linear functionals on V: a vector 
y € R@ acts on R? by x4 y'x. All k-tensors form a linear space of dimension d*. 

A tensor is antisymmetric, if interchanging two variables multiplies the func- 
tion value by —1. Perhaps the most important example is the determinant 
det(vi,...,Va) (v1,---,Va € R¢). All antisymmetric k-tensors form a linear space 
APV of dimension (@). In particular, there are no nontrivial antisymmetric k- 
tensors for k > d, and antisymmetric d-tensors are scalar valued and just constant 
multiples of the determinant. 

Every k-tensor T gives rise to an antisymmetric one by antisymmetrization: 


i 
AT(vi,.--, Ve) = A S- sgn(m)T (Va(1),++ +5 Vr(k)): 


The tensor product of two tensors T and S is defined simply as the product of 
functions, with disjoint sets of variables: 


(ToS)(vi,...,Va,W1,---,Wr) =T(v1,..-,Va)S(wi,..., Wa). 
For two antisymmetric tensors, this product is not antisymmetric, but it can be 
made antisymmetric; this way we define their exterior product 
TAS =A(ToS). 
Every vector u € R¢ can be considered as 1-tensor, since it defines a linear map 


x +> u!x, which is trivially antisymmetric. We can define the exterior product of 
two vectors: 


1 
uAv = a\uov vou), 
or indeed, any number of vectors: 


u,A---Auy, = A(uyo---oug). 
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Antisymmetric k-tensors of the form u,/---A ug generate the space A*V; in fact, 


it suffices to restrict u,,...,ug, to elements of any fixed basis. We consider the 
tensors e;, \---Aei;, (1 < i: < ig < +++ < ig < d) as the standard basis in AV. 
It is not hard to see that uj A---Au, 4 0 if and only if u,,...,u, are linearly 
independent. 


From the dimension formula it follows that A"V & A¢-*V. In fact, we can 
consider a tensor v; \---Avg_x as a linear functional on A*V acting as 


(vi A---Ava—x) [U1 A-+: Aug] = det(uy,...,Ux,V1,---,; Va—k) 


(extending linearly to A*V and A¢-V). This gives an explicit identification be- 
tween A’V and A¢-FV. If u,v € R?, then uAv € A?R® & R®, and so uAv can be 
considered as a vector in R*. It is easy to check that this vector is just the cross 
product ux v. 


If uj,...,Uz and vj,...,Vvx% are two sets of linearly independent vectors, then 
u,A---Augx and v, A---Avz are nonzero scalar multiples of each other if and only 
if lin(wi,...,ux) = lin(vi,...,v%). So if U is a linear subspace of R¢, then the 


exterior product of a basis of U is an encoding of the subspace that is (almost, up 
to a nonzero scalar) independent of the basis selected, and we will denote it by Ty. 
If U and W are two linear subspaces, then Ty A\Ty 4 0 if and only if UNV = 0; in 
this case, Ty ATy = Tyiv. 

The following identity about determinants, called the Binet-Cauchy Formula, 
will be useful. For a matrix A and subset S of its rows, we denote by Ag the 
submatrix consisting of the rows in S. Let A,B € R"*™, where m < n. Then 


(A.5) det(A"B) = S > det(ASBs). 
se(I"]) 


m 


APPENDIX B 


Graphs 


B.1. Basics 


A graph, unless otherwise emphasized, is a simple graph, i.e., it has no loops 
(edges connecting a node to itself) or parallel edges. We use the word node for the 
node of a graph, the word vertex for the vertex of a polytope, and the word point 
for points in the plane or in other spaces. If we need multiple edges, we use the 
word multigraph. If we need loops, we say this explicitly. 

For the graph denoted by G, we denote its node set by V and its edge set by 
FE, and the numbers of nodes and edges of G by n and m, respectively. Unless 
we state otherwise, we tacitly assume that V = [n] = {1,...,n}. For S C V, we 
denote by G[S] = (5, E[S]) the subgraph induced by S. For a simple graph G, we 
denote by G = (V, E) the complementary graph, often called the complement of G. 
The number of connected components of a graph G will be denoted by c(G). 

A set of mutually adjacent nodes is called a clique, and a set of mutually 
non-adjacent nodes, a stable set. Nonadjacent nodes are often called “indepen- 
dent”, and so a stable set of nodes is called an “independent set”; but this word 
is overloaded even in the context of this book, since we have to talk about “lin- 
early independent vectors”, “algebraically independent numbers”, and also about 
“independent events”. 

We denote the neighborhood of a node v in a G (the set of nodes adjacent to it) 
by Ne@(v), or in the case when the graph is understood, by N(v). The degree of the 
node v, the number of edges with an endpoint in v, will be denoted by deg(v). For 
a simple graph, deg(v) = |N(v)|. Two nodes of a graph are twins, if they have the 
same neighborhoods. We say that a node w distinguishes u and v, if is connected 
to one but not to the other. So two nodes are twins if and only if they are not 
distinguished by any node. If no loops are allowed (most of the time in this book), 
then twin nodes must be nonadjacent. 

The linegraph L(G) of G is defined on V(L(G)) = E, where to edges of G are 
adjacent in L(G) if and only if they have a node in common. 

We have to introduce several basic graph parameters that will be used through- 
out this book. 


w(G): clique number, the number of nodes of the largest clique; 
a(G): stability number, the number of nodes of the largest stable set; 
x(G): chromatic number, the minimum number of stable sets into which 


the node set can be partitioned. 


Clearly the stability number of a graph is the clique number of its complement; 


we do not introduce a separate expression for the chromatic number x(G) of the 
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complement. We define a metric dg on V, where dg(u, v) is the minimum length 
of path in G connecting u and v. 

For X C V, we denote by N(X) the set of neighbors of X, i.e., the set of nodes 
j € V for which there is a node i € V such that ij € E. Warning: N(X) may 
intersect X (if X is not a stable set). We denote by N the neighborhood in the 
complementary graph: N(i) = V\ N(i)\i. 

Let G = (V, £) be a directed graph. For an edge (arc) e, we denote by hd(e) 
and tl(e) the head and the tail of e. 


B.2. Flows and cuts 


Let G be a directed graph and s,t € V. A vector z € RF is called an s-t flow, 
if the flow into any node other than s or t¢ is equal to the flow out of this node: 


> = > Be (ve V\{s,t}). 


e: hd(e)=v e: tl(e)=v 
It is easy to check that 
Dd, %- DL te DL te DL te 
e:hd(e)=t e:tl(e)=t ~—e: tI(e)=s e: hd(e)=s 
the common value w of the two sides is the value of the flow. All these equations 
can be written compactly as B'z = w(1,;—1,). 

From the rich theory of flows, we only recall its central result, the Max-Flow- 
Min-Cut Theorem |Ford—Fulkerson 1962] (see e.g. |Schrijver 2003) for much more). 
An s-t cut (s,¢ € V) is defined by a set of nodes U with s € U but t ¢ U, as the 
set of edges e € E with tl(e) € U but hd(e) ¢ U. Every edge e of the graph has a 
capacity Ce > 0 assigned to it. A flow z is feasible, if 0 < z- < ce for every edge e. 
The capacity of a cut C is the sum of capacities of its edges. 


Theorem B.1 (Max-Flow-Min-Cut Theorem). The maximum value of a fea- 
sible s-t flow is equal to the minimum capacity of an s-t cut. 


A fundamental theorem in graph theory, Menger’s Theorem is closely related to 
flow theory; it can be derived from the Max-Flow-Min-Cut Theorem. For X,Y C V, 
we denote by K(X,Y) connectivity between X and Y, defined as the minimum 
number of nodes that cover all X-Y paths in G (paths connecting a node in X to 
a node in Y). 


Theorem B.2 (Menger’s Theorem I). «(X,Y) the maximum number of node 
disjoint X-Y paths. 


(If X and Y are not disjoint, some of these paths may consist of a single node.) 

A graph G is k-connected if and only if n > k and «(X,Y) = k for any two 
k-subsets X and Y. The largest k for which this holds is the node-connectivity of G, 
denoted «(G). The complete graph K,, is (n—1)-connected but not n-connected. 
A version of Menger’s Theorem characterizes node connectivity: 


Theorem B.3 (Menger’s Theorem II). A simple graph G is k-connected if and 
only ifn > k and any two nodes are connected by k paths that are disjoint except 
for their endpoints. 


Further versions of Menger’s Theorem concern edge-disjoint paths and sepa- 
rating sets by edges; for these, we refer to any basic graph theory book. 
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B.3. Matrices associated with graphs 


Let G be a (finite, undirected, simple) graph with node set V(G) = [n]. The 
adjacency matrix of G is be defined as the nx n matrix Ag = A = (Aj;) in which 


Nese 1, if 7 and 7 are adjacent, 
1 )0, otherwise. 


(We suppress the subscript G if the graph is understood.) 

We can extend this definition to the case when G has multiple edges: we just 
let Aj; be the number of edges connecting i and 7. We can also have weights on 
the edges, in which case we let Aj; be the weight of the edge. We could also allow 
loops and include this information in the diagonal. 

The Laplacian of the graph is defined as the Vx V matrix Lg = L = (£;;) in 
which 

ie ifi=Jj, 
Liz = sigan Joos 
-Ajy, iif. 
In the case of edge-weighted graphs, we define deg(?) = }7; Aij. So L = D—A, 
where D = Dg is the diagonal matrix of the degrees of G. Clearly [1 = 0. The 
normalized Laplacian is defined as L=D7/2Lp-¥2, 

The (generally nonsquare) incidence matrix of G comes in two flavors. For 
an undirected graph G, we define its node-edge incidence matrix M = Mg as the 
ExV matrix for which 


1 if v is an endpoint of e, 
Mev = ‘ 
0 otherwise. 


Often, however, the following matrix is more useful. We consider a directed graph 
G, and let B = Bg denote the E x V matrix for which 


1 if v = hd(e), 
Be =< —-1 ifv =tli(e), 
0 otherwise. 


While Bg depends on the orientation of G, changing the orientation only means 
scaling some rows by —1, which often does not matter much, and we can use B 
for any undirected graph, giving it an arbitrary reference orientation. Column 
by = B41, of B, corresponding to node v, satisfies |b,|? = deg(v). For an edge 
e = uv, the corresponding column of B' is 1,—1,. In topological terms, B is 
the (0-dimensional) coboundary operator. Its transpose B' is the (1-dimensional) 
boundary operator. 

It is easy to check that, independently of the orientation, L = B'B. This 
implies that the Laplacian L is positive semidefinite. It is worth while to express 
this equation in terms of quadratic forms: 

n 
(B.1) x'Lx = (Bx)"(Bx) = SO (ai—2,)’. 
ijCE 

We need some general terminology. Given a graph G, a G-matriz is a symmetric 
matrix M € RY*Y such that M;; = 0 for every edge ij € EH. A G-matrix is well- 
signed, if M;; < 0 for all ij € E. (Note that we have not imposed any condition on 
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the diagonal entries.) So the adjacency matrix A is a G-matrix, and the Laplacian 
L is a well-signed G-matrix. We denote the linear space of G-matrices by Mg. 


B.3.1. Nullspaces and ranges. Let us fix a connected graph G. Every 
vector x in the (right) nullspace of B satisfies x; = x; for every edge ij. So x is 
scalar multiple of 1. The row space of B is the orthogonal complement of this space, 
i.e., the set. of all vectors in RY with coordinates summing to 0. (B.1)) implies that 
the nullspace of L is the same as the nullspace of B, and similar conclusion holds 
for the row space of L. It follows that rk(L) = rk(B) =n-1. 

To determine the column space of B (equivalently, the range of B'), notice 
that (Bx)e = Lha(e) — te) for x € RY. A vector on the (oriented) edges arising as 
differences of a vector on the nodes is called a potential. Potentials form a linear 
space A = Ag = Rng(B), with dim(A) = rk(B) = n—1. (See Section |4.1|for more 
on potentials.) 

The nullspace of B' is more interesting. A vector z € R® is called a circulation 
if B'z = 0. Explicitly, this means that z satisfies the flow condition (conservation 
condition) at each node: 


(B'z), = S- Ze— >. Ze = 0. 


e: hd(e)=v e:tl(e)=v 


Every cycle C in G defines two circulations, obtained by sending one unit of flow 
around the cycle, in either direction. In other words, the circulation fc¢ defined by 
a cycle C' with an orientation (not necessarily an oriented cycle!) is defined by 


1, if e is a forward edge of C, 
(B.2) (fe)e= 4-1, if e isa backward edge of C, 


0, otherwise. 


Circulations form a linear space A+ C R®, which is generated by the special circu- 
lations fc. It is easy to see that dim(A+) =m-—n-+1. 

Reversing the orientation of an edge corresponds to multiplying the correspond- 
ing coordinate by —1, both for A and A+. Reversing all edges does not change 
these linear spaces. 

The rank, nullspace and range of the adjacency matrix A do not have such a 
nice and easy descriptions (but see Section [9.1). 


B.4. Spectra of graphs 


HKigenvalues of matrices associated with graphs are often closely related to 
their geometric representations, but of course they represent a topic about 
which several monographs have been written 
Colin de Verdiere 1998b}. Here we give a very incomplete survey of some of the 
properties that play a role in our treatment, or at least motivate some of our con- 
siderations. 

To fix notation, let Ay > Ag > +--+: => Ay be the eigenvalues of the adjacency 
matrix A of a graph G, let wy < pe <--- < pn be the eigenvalues of the Laplacian 
LD, and let yp < v2 < +--+ < vw be the eigenvalues of the normalized Laplacian 
L=D~'/2LD~-1/2, For a d-regular graph, we have pi; = d—A, and yj = p;/d. 
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B.4.1. The leading eigenvalue. The Perron—Frobenius Theorem implies 
immediately that if G is connected, then A; has multiplicity 1. This eigenvalue 
is relatively uninteresting, it is a kind of “average degree”. More precisely, let d, 
dmax and dyin denote the average, maximum and minimum degree, respectively. 
Then the following inequalities hold: 


(B.3) meee. 4) dca) SAS nas 


In particular, A; = d for every d-regular graph. Inequalities (B.3) imply that 
dmin < A1, which in turn implies by induction the following bound on the chromatic 
number: 


(B.4) x(G) < A, +1. 
Hence 
(B.5) w(G) <A, +1, 


which is of course weaker than (B.4), but it is useful as a starting point of general- 
izations in Chapter [11] 

The smallest eigenvalue of the Laplacian Lg is really uninteresting, since 
Ly = 0. It follows from that the Laplacian Lg is singular and positive semi- 
definite. Furthermore, the multiplicity of 0 as an eigenvalue is equal to the number 
of connected components of G. Similar statement is not true for the adjacency 
matrix (if the largest eigenvalues of the connected components of G are different, 
then the largest eigenvalue of the whole graph has multiplicity 1). This illustrates 
the phenomenon that the Laplacian is often better behaved algebraically than the 
adjacency matrix. 


B.4.2. The smallest eigenvalue. It is a nice elementary fact that a graph 
is bipartite if and only if its spectrum is symmetric about the origin. For connected 
graphs, it suffices to look at the largest and smallest eigenvalues: A connected graph 
G is bipartite if and only if An(G) = —A1(G). 

For chromatic number larger than 2, no similar full characterization in terms of 
eigenvalues can be given; but at least the necessity part of the result above can be 
generalized [Hoffman 1970]. The ratio between the largest and smallest eigenvalues 
can be used to give a lower bound on the chromatic number in terms of the spectrum 
(recall that provides an upper bound): 


At 
(B.6) x(@) 21- >. 


The smallest eigenvalue is closely related to the characterization of linegraphs. 
It was noted in that the smallest eigenvalue of the linegraph of a 
simple graph is at least —2. This property “almost” characterizes linegraphs. It 
was proved in that if G is a simple graph on at least 37 nodes, 
then A,(G) > —2 if and only if G is obtained from the line-graph L(H) by deleting 
a set of disjoint edges from each clique of L(H) corresponding to the edges of H 
connecting a node to nodes of degree 1. 


B.4.3. The eigenvalue gap. The difference between the second and the first 
eigenvalues (the “eigenvalue gap”) is an extremely important parameter in many 
branches of mathematics, and graph theory is no exception. We start with dis- 
cussing the Laplacian LZ of a connected graph G. The smallest eigenvalue of the 
Laplacian, which is zero, has multiplicity 1. It is often important to bound the 
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next smallest eigenvalue Az from below (and sometimes, from above). This is of- 
ten called the Fiedler value of the graph. There are many good upper and lower 
bounds for Az in terms of various graph parameters; for us, it will suffice to know 
that Az > n~? (see for a survey). In Section [5.4] we illustrate the use 
of geometric methods by proving an upper bound on the Fiedler value of planar 
graphs. 

Since the smallest and the next smallest eigenvalues of D are equal if and only 
if the graph is disconnected, we can expect that the gap between them is some kind 
of connectivity measure of the graph. Indeed, fundamental results relate the eigen- 
value gap to expansion (isoperimetric) properties of graphs. These results (which 
have many related, but not equivalent, versions) can be considered as discrete ana- 
logues of Cheeger’s inequality in differential geometry. 

We illustrate this connection by two spectral estimates of the node-expansion 
and edge-expansion of a graph. 


Expanders. This highly nontrivial class of graphs (more precisely, class of graph 
sequences) is the key element in many graph-theoretic constructions. An expander 
is a regular graph with small degree in which the number of neighbors of any set 
containing at most half of the nodes is at least a constant factor of its size. To 
be precise, the node-expansion xp(G) of a graph G is defined as the largest ¢ such 
that |N(S)\S| > e|S| for every set S Cc V with |S| < n/2. An (a, b)-expander is an 
a-regular graph with xp(G) > b. 

Expanders play an important role in many applications of graph theory, in 
particular in computer science. The most important expanders are d-regular ex- 
panders, where d > 3 is a small constant. Such graphs are not easy to construct. 
One method is to do a random construction: for example, we can pick d random per- 
fect matchings on 2n nodes (independently, uniformly over all perfect matchings), 
and let G be the union of them. Deterministic constructions are much more difficult 
to obtain, and are typically based on deep algebraic facts. The first construction 
was given in [Margulis 1973}; see for a construction 
that is optimal in some sense. 

The following spectral bounds on the node-expansion of a d-regular graph 
[Alon 1986], play an important role in analyzing the above 


mentioned algebraic constructions: 


1 
(B.7) sqt2 < xv(G) < 3Vfa. 


Since we are considering only regular graphs, these bounds are easily translated to 
the spectrum of the adjacency matrix or of the transition matrix. 

Edge expansion. There is an equally important connection between the eigen- 
value gap of the normalized Laplacian D and a quantity that can be viewed 


as an edge-counting version of the expansion. For every subset S C V, let 
deg(S) = >0,<5 deg(i), and define 


. 2mec(S,V\S) 
® = : 
ci gcscV deg(S') deg(V \ S') 
For a d-regular graph, this can be written as 

_ n ea(S, V\S) 
)= gg ISVS 


a(G 
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The following basic inequality |Jerrum—Sinclair 1989) is a discrete analogue of 
Cheeger’s Inequality in differential geometry: 

&(G)? 
16 
This inequality plays a central role in the theory of rapidly mixing Markov chains, 

which is however not the subject of this book. 


(B.8) < 1-1 < 0G). 


B.4.4. Multiplicity of eigenvalues. Multiplicity of eigenvalues usually cor- 
responds to symmetries in the graph (although the correspondence is not exact). 
It was proved in and that if all eigen- 
values of A are different, then every automorphism of A has order 1 or 2. 

There is also an interesting graph-theoretic property related to the other ex- 
treme case, when the graph has very few distinct eigenvalues. A graph G is called 
strongly regular, if it is regular, and there are two nonnegative integers a and b such 
that for every pair i,7 of nodes the number of common neighbors of i and 7 is 


a, if 2 and J are adjacent, 
b, if¢ andj are nonadjacent. 


Many interesting, highly symmetrical graphs are strongly regular. These graphs 
can be characterized in terms of their spectra: A connected graph G is strongly 
regular if and only if it is regular and Ag has at most 3 different eigenvalues. 

The multiplicity of the second largest eigenvalue is also important; it is dis- 
cussed in Chapter [16] in connection with the Colin de Verdiére number. 


APPENDIX C 


Convex Bodies 


C.1. Polytopes and polyhedra 


A convex body in R® is a closed, bounded convex set that has an interior point. 
The convex hull of a finite set of points in R@ is called a (convex) polytope. The 
intersection of a finite number of halfspaces in R¢ is called a (convex) polyhedron. 
(We'll drop the adjective “convex”, because we never need to talk about nonconvex 
polyhedra.) Every polytope is a polyhedron. A polyhedron is a polytope if and 
only if it is bounded. 

For every polyhedron, there is a unique smallest affine subspace that contains 
it, its affine hull. The dimension of a polyhedron is the dimension of its affine 
hull. A polyhedron [polytope] in R¢ that has dimension d (equivalently, that has 
an interior point) is called a d-polyhedron [d-polytope]. 

A hyperplane # is said to support the polyhedron if it has a point in common 
with the polyhedron and the polyhedron is contained in one of the closed halfspaces 
with boundary H. A face of a polyhedron is its intersection with a supporting 
hyperplane. A face of a polyhedron that has dimension one less than the dimension 
of the polyhedron is called a facet. A face of dimension 0 (i.e., a single point) is 
called a vertex, a face of dimension 1 is called an edge. The vertices and edges of a 
polytope P form a simple graph G'p, which we call the skeleton of the polytope. 

Every face of a polyhedron [polytope] is a polyhedron [polytope]. Every vertex 
of a face is a vertex of the polyhedron. Every polytope has a finite number of faces. 
Every polytope is the convex hull of its vertices. The set of vertices is the unique 
minimal finite set of points whose convex hull is the polytope. 

Every facet of a d-polyhedron P spans a (unique) hyperplane, and this hy- 
perplane is the boundary of a uniquely determined halfspace that contains the 
polyhedron. The polyhedron is the intersection of the halfspaces determined by its 
facets this way. 

A convex cone in R” is a set of vectors is closed under sum and multiplication 
by positive scalars. Note that according to this definition, the set R” is a convex 
cone. We call the (topologically) closed cone pointed, if the origin is a vertex of 
it; equivalently, if it does not contain a line. Any system of homogeneous linear 
inequalities 


tle 20, oe SO 


defines a convex cone; convex cones defined by such (finite) systems are called 
polyhedral. 

Let P be a polytope in R@, let a € R%, and let u be a vertex of P. Suppose 
that P has a vertex v such that a'u < a!v. Then P has a vertex w such that uw 
is an edge of P, and a'u < a'w. 
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Another way of formulating this is that if we consider the linear objective 
function a'x on a polytope P, then from any vertex we can walk on the skeleton 
to a vertex that maximizes the objective function so that the value of the objective 
function increases at every step. This important fact is the basis for the Simplex 
Method. For our purposes, the following corollary this fact will be important as well: 
If G is the skeleton of a d-polytope, H is an (open or closed) halfspace containing 
an interior point of the polytope, then the subgraph of G induced by those vertices 
of P contained in this halfspace is connected. 


C.2. Polyhedral combinatorics 


As mentioned in the Preface, many applications of geometry in combinatorics 
make use of polyhedra defined by combinatorial structures. We cannot develop this 
large theory here, but we need some of these polyhedra. 

The Stable Set Polytope of a graph is an important example studied in Section 
11.4.2| Here we describe a couple of other families related to flows and cuts only. 
We formulate these examples for directed graphs, but we can get the corresponding 
definitions and facts for undirected graphs by replacing every undirected edge by 
two, oppositely directed edges. 

A few special properties of polytopes (more generally, of convex sets) are often 
encountered in combinatorial situations. A convex set K C R? is called ascending, 
ifx € K, y € R¢ and y > x implies that y € K. For any convex body K, we define 
its dominant as 

RK ={yeR?: ave K, x<y}. 
This dominant is an ascending convex set. A convex set K C R? is called a convex 
corner, if kK C Re, andxe Kk,yeé Re and y < x implies that y € K. A polytope 
which is a convex corner will be called a corner polytope. 


C.2.1. Flows and cuts. In the following examples of particularly important 
combinatorial polyhedra, we fix a directed graph G and two nodes s,t € V. While 
there will be perhaps more versions than one would like to memorize, these are 
often related to each other in simple ways (containment, domination). We consider 
directed graphs, but corresponding notions and facts for undirected graphs can be 
obtained by replacing each edge by a pair of oppositely oriented edges. 

We start with the path polytope, which is a polytope in R¢, defined as the 
convex hull of indicator vectors of edge-sets of directed s-t paths. Every vector in 
the path polytope is an s-t flow of value 1. If we add circulations to these flows, 
we get all s-t flows of value 1. Such flows form the 1-valued flow polyhedron (this is 
not a polytope, since circulations can be unbounded). 

The flow polyhedron is the set of all s-t-flows. The 1-valued flow polyhedron is 
the intersection of it with a hyperplane. 

Often one considers the feasible flow polytope: every edge e has a capacity 
Ce = 0, and we impose the conditions x, < c, on the flow values x,. Those capacity 
assignments that allow a feasible flow of value 1 also form a convex polyhedron; by 
the Max-Flow-Min-Cut Theorem, this is described by the inequalities 


Ce = 0, 


S- Ce > 1 (for every s-t cut C). 
e€C 
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This polyhedron is just the dominant of the path polytope, and we call it the path 
polyhedron. 

We can also work on the node set of G. The node-path polytope is the convex 
hull of indicator vectors of interior nodes of directed s-t paths. This is a poly- 
tope in RY \{* +}, Its dominant, the node-path polyhedron can be described by the 
inequalities 


Ly 20, 


S- Ly >1 (S CV separates s and f). 
ves 


Closely related to the above constructions is another class of polyhedra. The 
cut polytope is the convex hull of indicator vectors of directed s-¢ cuts. Its dominant, 
the cut polyhedron consists of all assignments of nonnegative “lengths” x, to the 
edges for which the length of every s-t path is at least 1; formally, 


Le > 0, 


S- Le>1 (for every s-t path P). 
e€ E(P) 


Again, we can consider the version for nodes. The node-cut polytope, defined in 
RV \{s-4t, is the convex hull of indicator vectors of s-t node-cuts (minimal subsets of 
V \{s,t} intersecting every s-t path). The node-cut polyhedron is the dominant of 
the node-cut polytope. The node-cut polyhedron can be described by the inequal- 
ities 
Ly 20, 


>. Ly >1 (S C V separates s and f£). 
ves 


C.3. Polarity and variations 


C.3.1. Polarity. Let K be a convex body containing the origin as an interior 
point. The polar of K is defined by 


K* ={xeR?: x'y <1 Vy eK}. 


It is clear that K* is a convex body as well, containing the origin in its interior. 
For every convex body K we have (K*)* = K. 

The polar of a polytope (containing the origin) is a polytope. Platonic bodies 
in the 3-dimensional space come in polar pairs, except that the regular tetrahedron 
is congruent to its own polar (Figure |C.1). 

For every vertex v of a polytope P (in any dimension), the inequality v'x < 1 
defines a facet of P*, and vice versa. The vector v is a normal vector of the facet 
v'x <1. More generally, if vo,...,Vm are the vertices of a k-dimensional face F 
of P, then 

Pl otee PY: vixeil)..vizat 
defines a d—k—1-dimensional face of P*. Furthermore, (F+)+ = F. (See Figure 
C3) 

We can, at least formally, define K* for more general convex sets. If K is 
convex cone with its apex at the origin, then the condition x'y < 1 for all ye K 
implies that x'y < 0. So the polar of a convex cone is a convex cone. It is often 


416 Cc. APPENDIX 


FIGURE C.1. The 5 platonic bodies, in polar pairs (the tetrahe- 
dron is its own polar). 


a alx<i 


FIGURE C.2. A 2-dimensional polytope and its polar. 


more convenient to work with the negative of this, called the dual cone, defined as 
{x € R" : 2'y > 0 Vy € K}. This is again a convex cone. If K is closed (in the 
topological sense), then applying either duality or polarity twice, we get K back. 

The nonnegative orthant is self-dual, and so is the cone of positive semidefinite 
matrices. If K is a linear subspace, then its polar (or dual) is the orthogonal 
subspace K+. 

C.3.2. Blocking. There are two constructions similar to polarity that concern 
convex bodies not containing the origin in their interior; rather, contained in the 
nonnegative orthant. 

The blocker of an ascending convex set K C R4 is defined by 


k= {xe Ri: x'y > lvye K}. 


(see Figure |C.3p. 


ry 


FIGURE C.3. A 2-dimensional ascending polytope and its blocker. 
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The blocker of an ascending convex set is a closed ascending convex set. The 
blocker of an ascending polyhedron is an ascending polyhedron. For every ascending 
closed convex set K, we have (KP!) = kK. 

The correspondence between faces of a polyhedron P and P?! is a bit more 
complicated than for polarity, and we describe the relationship between vertices 
and facets only. Every vertex v of P gives rise to a facet of P®!, determined by 
the supporting hyperplane v'x > 1. This construction gives all the facets of P", 
except possibly those corresponding to the nonnegativity constraints x; > 0, which 
may or may not define facets. 


Example C.1. Fixing two nodes s and ¢ of a graph G, the blocker of the path 
polyhedron is the cut polyhedron. The blocker of the node-path polyhedron is the 
node-cut polyhedron. 4 


It is easy to see that every ascending convex body K has a unique point x 
which is closest to the origin. (This point has combinatorial significance in several 
cases.) It is not hard to prove that this point is closely related to the analogous 
point of the blocker. 


Proposition C.2. Let kK C R¢ be an ascending convex set, and let x € K min- 
imize the objective function |x|?.. Then y = (1/|x|?)x is in the blocker K*!, and 


it minimizes the objective function |y|? over K®'. It follows in particular that 
d(0, P) d(0, P®') = 1. 


C.3.3. Antiblocking. The antiblocker of a convex corner K is defined by 
K*! = {xeRi: x'y < lvye K} 


(see Figure [C.4p. 


b'x<1 


FIGURE C.4. A 2-dimensional convex corner and its antiblocker 


The antiblocker of a convex corner is a convex corner. For every convex corner 
K, we have (K>!)abl = Kk, 

If the convex corner P is a polytope, then so is its antiblocker P*>!. The 
correspondence between faces of P and P*»! is again a bit more complicated than 
for the polars. The nonnegativity constraints x; > 0 always define facets, and they 
do not correspond to vertices in the antiblocker. All other facets of P correspond 
to vertices of P®>!. Not every vertex of P defines a facet in P?>!. The origin is a 
trivial exceptional vertex, but there may be further exceptional vertices. A vertex 
of P defines a facet of P* if and only if there is no other vertex w such that v < w. 
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C.4. Balls, spheres and caps 


C.4.1. Circle geometry. Linear, affine and projective geometries are built 
on (straight) lines, planes and flats. There is another beautiful piece of geometry 
built on circles and spheres. We are going to need some of its basic facts, which are 
summarized here. 

For a radius r > 0, a ball is the set of points at a distance at most r from a 
given point x called its center. Its boundary is a (d—1)-sphere. The unit ball in 
R?@ is the set B¢ = {x € R¢: |x| < 1}, and the unit sphere is the set S4~! = {x € 
R?: |x| = 1}. We can talk about a lower dimensional ball or sphere in R¢: this is 
a ball or sphere in an affine subspace. The intersection of a sphere with an affine 
subspace or with another sphere is either a sphere, or a point, or empty. 

The intersection of a sphere S with a closed affine halfspace H, assuming it is 
not empty, not the whole sphere, and not a single point, is called a cap. The cap is 
proper if H does not contain the center of the sphere. 

For a cap C on the sphere S, there are three possible points that can be thought 
of as its “center”. Let us assume, for simplicity, that S = S¢~', and we are working 
in R¢. The line @ through the center of S$ orthogonal to the halfspace H intersects 
the sphere at two points, one of which belongs to the cap; we call this the center of 
the cap. The line @ intersects the boundary of H at a point, which we call the inner 
center of the cap. Finally, assuming that the cap is proper, @ contains a point from 
which the visible part of the sphere is exactly C’; we call this point the pole of the 
cap. 


Inversion. Also called “reflection in a sphere”, inversion is an involution of the 
d-space. To be more precise, inversion with respect to the a (d—1)-dimensional 
sphere S$ C R?@ centered at the origin and with radius r is defined as the map 

ye 
(C.1) xh ae 
Inversion with respect to a sphere with another center can be defined by shifting 
the coordinate system accordingly. This definition is valid in every dimension, but 
we will use it mostly in dimension 2. 

The points of the sphere are fixed by this transformation, and interior points 
are mapped onto exterior points and vice versa. For the origin, inversion is not 
defined by (C1). We can say that this involution is a transformation of R®\ {0}, 
or—often better—we can add a “point at infinity”, denoted by oo, and define 0 
and co to be images of each other. The inner center and pole of a proper cap are 
inverses of each other (Figure |C.5). 


FIGURE C.5. Left: A pair of inverse points as the center and pole 
of a cap on a 1-dimensional sphere. Right: The inverse of a circle. 
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Perhaps the most important property of involution is that it is sphere- 
preserving. More precisely, inversion preserves the set of (d—1)-dimensional spheres 
and hyperplanes (we can consider hyperplanes as degenerate spheres). Hyperplanes 
through the origin are mapped onto themselves. Hyperplanes not going through the 
origin are mapped onto spheres going through the origin, and vice versa. Spheres 
not going through the origin are mapped onto spheres not going through the origin. 

Inversion also preserves angles between two smooth curves intersecting each 
other, in particular it preserves angles between intersecting circles and/or lines. 


Stereographic projection. Let S be a sphere in the d-dimensional space, and let 
a € S be any point on the sphere, and let b be its antipode. Let H be a hyperplane 
orthogonal to the diagonal a—b, not going through ab. Projecting the sphere from 
ab onto H is called stereographic projection. While this definition is important in 
every dimension, we will use it mostly in dimension 3. Sometimes we call a the 
“North pole”, and its antipode, the “South pole” (Figure |C.6). 

The choice of H is not very important: replacing it by another, parallel hyper- 
plane results only in a homothetical transformation in the plane. The plane H is 
sometimes chosen to be tangent to S' at the South pole, sometimes passing through 
the center of the sphere; the former choice has the advantage that it is almost an 
isometry near the South pole; the latter, that it leaves the equator invariant. The 
inverse stereographic projection, projecting the plane onto the sphere from a to S, 
is also used here and there in this book. 


FIGURE C.6. Stereographic projection of a circle. 


A little computation shows that (assuming H is the hyperplane tangent to S 
at the South pole) stereographic projection can be thought of as inversion with 
respect to the sphere centered at the North pole and going through the South pole. 
This implies that stereographic projection and its inverse map circles and lines onto 
circles and/or lines, and preserve the angle between them. 


Mobius transformations. Let us restrict our attention to a 2-sphere S. We 
consider bijective homeomorphisms of S onto itself that map circles onto circles; we 
call such a map circle-preserving. A trivial example is the reflection of the sphere in 
a plane through its center; this reverses the orientation of circles. Circle-preserving 
maps that also preserve the orientation are called Mobius transformations. It is 
easy to see that every circle-preserving orientation-reversing map can be obtained 
as a Mobius transformation followed by a reflection. 
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These transformations of a sphere S can be described in various ways. Mobius 
transformations are exactly those maps of the form 
az+b ae a 
rae qe Os ca ars. 
(where a,b,c,d are complex numbers such that ad—bc 4 0, and the sphere S' is 
identified with CU{oo}). Circle-preserving transformations of S' can be obtained 
from projective transformations of the projective 3-space keeping S invariant, by 
restricting them to S. We can also apply a stereographic projection of S to a plane 
not necessarily tangent to S (keeping in mind that the North pole is mapped onto 
an additional point co), move the plane to some other position in space, and project 
it back. Inversion with respect to another sphere S’ orthogonal to S, restricted to S, 
is a circle preserving, but orientation reversing transformation of S, and every circle 
preserving transformation of S can be decomposed as a product of such inversions. 
Circle-preserving transformations of S$? form a group M, of which Mobius 
transformations form a subgroup M of index 2. Another subgroup is the group 
O(3) of orthogonal transformations (isometries of the sphere). We will need the 
fact that every finite subgroup of M is conjugate to a subgroup of O(3). 


C.4.2. Volume. We denote by 7g the volume of the d-dimensional unit ball 
B?, It is known that 


k 
us : 
/2 kl if d = 2k, 
(C.2) fd = sa ay = ae 
'(1+5) ee wae ohaa, 
(2k +1)!!! 
For both the even and odd case, Stirling’s Formula gives the asymptotics 
1 /2en, 4/2 
03 ae 
C28) 0 Vand\ d 


(It is funny that both classical transcendental numbers e and 7 occur together 
here.) The surface area of this ball (the (d—1)-dimensional measure of the sphere 
So?) 4s dig: 

A cap on this ball can be described as the set Cu,, = {v1 € S41: u?zx > s}, 
where u € S¢~! is the center of the cap, and —1 < s < 1. We will need good 
bounds on the (d—1)-dimensional surface area of this cap; it suffices to restrict 
our attention to the case when s > 0. In dimension d = 3, the area of this cap 
is 2x(1—s). If the spherical radius of this cap is p, then this area can also be 
expressed as 47 sin?(p/2). 

In higher dimension, the surface area cannot be expressed so explicitly. It is 
easy to derive the integral formula: 


1 
(C.4) vOln—1(Cu,s) = (n—1)tm-1 f (1—a:?)("-3)/2 dy, 


but it is hard to bring this into a more manageable form. Instead of explicit 
formulas, the following estimates will be more useful for us: 


Tn-1 


2 \ n= 2 
(C5) (1 ) (1 = 5?)-?? < voln1(Cu,e) S 


1 52)(n-0)/2. 
s/n—1 8 8 as) 
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(The lower bound is meaningful for s > 2/./n—1.) The upper bound follows from 
the estimate 
1 


1 
fo-ayeo dx < =f ea-2yr-9" dz = 
s 


Ss Ss 


ag, 


n—1)s 


The lower bound uses a similar argument with a twist. Let s << t <1, then 
1 t t 


fo-ayeror dx > fo-syeor dx > 7 fea-2ye-9r dx 


1 ; 
—~_-_ (4 g2)@-D/2_(4_ #2 eae 
aaa (1-3?) (1-1?) 
Optimizing for t, we get the estimate with a somewhat tedious but routine compu- 
tation. 


C.5. Volume of convex bodies 


C.5.1. Classical inequalities. We start with a simple formula. Let K be a 
convex body in R” containing the origin in its interior, and for every unit vector v, 
let hx (v) denote the length of the segment in K of the semiline determined by v. 
Choosing v randomly and uniformly from $”~', we have 


(C.6) vol(K) = m,E(hx(v)”). 

Let Ky,...,Kq be convex bodies in R¢. It is not hard to see that the function 
f(a1,.--,@a) = vol(a, kK, +---+a Kg) is a homogeneous polynomial of degree d in 
the variables 71,...,2, > 0. The coefficient of 7, ...2g in this polynomial, divided 


by d!, is called the mized volume of K1,..., Ka, denoted by V(ky,..., Ka). 

This quantity has many deep and useful properties. It is easy to see that 
V(K,...,K) = vol(K). It is less trivial that the mixed volume is nonnegative. A 
deep inequality concerning mixed volume is the Alexandrov—Fenchel Inequality: 


(C.7) V(Ky, Ki, K3,...,Ka)V (Ko, Ko, K3,..., Ka) < V(Ki, Ko, K3,..., Ka)’. 


One can derive the Brunn—Minkowski Theorem from the Alexandrov—Fenchel 
Inequality (but it is easier to prove): 


For any two convex bodies A,B C R4, the value vol(tA+(1—t)B)'/¢ is a con- 
cave function of0O <t< 1. 

We need a consequence of this theorem. For a convex body K, we define its 
difference body K-—K ={x-y: x,y € K}. It is trivial that vol(K — K) > vol(K) 
(since K —K contains a translated copy of kK). The Brunn—Minkowski Theorem 
implies a much sharper bound: 


(C.8) vol(K —K) = avol(SK + 5(-K)) > 2¢vol(K). 


The volumes of a convex body and its polar are closely related. For every 
0-symmetric convex body K C R¢, 


d 
(C.9) a < vol(K)vol(K*) < 72. 


Using (C.2), it is easy to see that the upper bound is less than (27)“/d!, which is at- 
tained when both K and K™ are balls; this was proved for d < 3 by |Blaschke 1917 
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and for all dimensions by |Santalé 1949]. The lower bound (with a smaller constant 


instead of 7) was proved by [Bourgain—Milman 1987|. The currently known best 
bound above was proved by |Kuperberg 2008]. Mahler’s unsettled conjecture states 


that the minimum of this product over n-dimensional 0-symmetric convex bodies is 
attained when K is a cube and K™* is a cross-polytope (or the other way around), 
which would give a lower bound of 4¢/d!. 


C.5.2. Projection. We conclude with some inequalities concerning the length 
of projections in high dimensions; while these are well known and often used, it does 
not seem easy to find their proofs, and so I include proofs for several of them. 

For a set K C R” and linear subspace LZ C R”, let Kz denote the orthogonal 
projection of K onto L. In the case when L is a one-dimensional subspace generated 
by a vector v, then we also use the notation Ky, = Ky. 

The question about the area of caps can be generalized as follows: given a 
number 0 < c < 1, what is the surface area of $”~' whose projection onto the 
subspace formed by the first d coordinates has length at most c? If d = n—1, 
then this set consists of two congruent caps, but if d < n, then the set is more 
complicated. The cases c < ,/d/n and c > ,/d/n are quite different. We only need 
the first one in this book. 

Let x’ and x” denote the projections of a vector x onto the first d coordinates 
and onto the last n—d coordinates, respectively. It is easy to see that if u is chosen 
randomly, uniformly from $”~', then 

d 
(C.10) E(ju’|?) = —. 

n 
Let us assume that d < n—2 and (for the time being) c < 1. We want to find good 
estimates for the (n—1)-dimensional measure of the set 


A=ixes""< |< al. 


Define ¢(x) = x’+(x”)°, then ¢ maps A onto A’ = (sB4)x $"-4! bijectively. 
It is not hard to compute that the Jacobian of ¢~' (as a map A’ > A) is (1— 
|x’ |2)(r-4-2)/2 < 1, and hence 


(C.11) volyn—1(A) < voln_1(A’) = s4(n—d)tatn—a- 


(For n = d+2 the Jacobian is identically 1, whence vol, _;(A) = vol,_1(A’), which 
implies, for n = 3, the well-known formula for the surface area of caps mentioned 
before.) 

Another way of formulating this inequality is that choosing a random vector u 
uniformly from $”~!, it bounds the probability that its orthogonal projection to a 
d-dimensional subspace is small: 


(n—d)taTn—a <f 
NT, , 


(C.12) P(|w| < s) < 


Using the asymptotics for factorials, it is easy to derive the slightly weaker but 
more explicit bound 


(C.13) P(|u’| < s) < (2s ie 
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This estimate is interesting when s is smaller than the expectation of |u’|, say 
s = €,/d/n for some 0 < e < 1. Then we get 


(C.14) P(|u' ES 4) < (2e)4, 


To prove tighter bounds on the concentration of |u’| around its expectation, we 
can use an explicit description of generating a uniform random element of S"~!. We 
generate n independent standard Gaussian variables X1,...,X,, and normalize: 


1 
(C.15) a rrn 5 ComeTS A 


f/XP+---+X2 
Here the random variable X?+---+X? is from a chi-squared distribution with 


parameter n, and many estimates are known for its distribution. We need the 
following estimate (see e.g. [Massart_ 2003] ) 


(C.16) P(|X?+-+-4+X2—n| > en) <2e-"/8. 
If we generate a random unit vector u as described above, then 
ju’ |? an Xi +: : eG 
XP He 4X2 


By (C.16), the numerator is concentrated around d, and the denominator is con- 
centrated around n. Hence the length of u’ is concentrated around ,/d/n. To be 
more precise, it follows that for every ¢ > 0, 


(C.17) an la a+e/4 


l+teVn 


with probability at least 1— 4e~ © 4/4, 

Next, we turn to projections of convex bodies. Let kK C R” be a convex body, 
let v € S"—! be chosen randomly from the uniform distribution, and let Ky be 
the projection of K onto the line containing v. Let |K,| denote the length of this 


projection, which is the width of K in direction v. It is easy to check that 


1 
C.18 ae 
(C.18) [Ay | aa) 
Combining with (C.6), we can express the volume of (A — K)*: 
(C.19) vol((K —K)*) = mE(h(x—x)+(v)”) = mE(|Ky|~”). 
By the Blaschke—Santalé Inequality (C.9) and (C-8), this implies 
vol((K — K)*) Tn, Tn, 

‘ v —), = < <= 7 
(e20) E(|Ky|™) Tn, ~ vol(k — Kk) ~ 2°vol(K) 
Markov’s Inequality implies that for every s > 0, 

Tins” 

: v < = v eg = < ——__.. 

(6.21) P(IKy| <8) = P(lKe|"s" > 1) < ste 


This inequality can be extended to the case when K is not full-dimensional, 
but d-dimensional (d < n), and so we have to consider a lower dimensional measure 
of its volume. Let u be the projection of v onto R?. The length |u| and the 
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direction u° are independent random variables. Clearly |Ky| = |u| |Kul, where |u| 
and K, = Kyo are independent as random variables. Then, using (C.12), 
s (n—d) tama f 8 V4 
P(|Ky| <8) = P(|ul < =) < Eye (=3)) 
(IKy| Ss) ful s |Kul7 — Nn | yo| 
(n—d)taTn—a <(E 
NT 
and using (C.20) (noting that the projection K — Kyo happens in dimension d) 
we get 


w0(|Kyo|~“), 


(n— d) 72nd d 


.22 P(|Kky|<s)< 
(C.22) (I |< s) ~ ntp,24vol(K) 
Applying the asymptotic formula for 7,,, we get the more explicit bound 
As\/n\¢ 1 
(owe P(|Ky| <s) < (=) 
a (1 S8) (—F-) saa 
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